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AMENDMENT ACCOMPANYING SUBMISSION OF SEQUENCE LISTING 

Assistant Commissioner for Patents 
Washington, D.C. 20231 
Box SEQUENCE 



In order to comply with the requirements for patent applications containing amino 
acid and/or sequence disclosures, please amend the application as follows: 

IN THE SPECIFICATION : 

At the end of the specification, please insert the printed Sequence Listing 
submitted concurrently herewith. 



Applicants submit this Amendment to insert the required references to 
SEQ ID NOS of the Sequence Listing filed concurrently herewith, and to indicate the 
insertion point for the Sequence Listing. Applicants respectfully request examination on 
the merits of this application. 



Sir: 



REMARKS 



Respectfully submitted, 
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In re patent application of 
MURDIN, Andrew D. et al. 

Serial No.: U.S. National Entry Group Art Unit: 1643 

ofPCT/CA99/00992 

Filed: October 28, 1999 Examiner: Not assigned 

For: CHLAMYDIA ANTIGENS AND CORRESPONDING DNA FRAGMENTS AND 
USES THEREOF 

STATEMENT TO SUPPORT FILING AND SUBMISSION IN 
ACCORDANCE with 37 C.F.R. §§ 1.821-1.825 



Assistant Commissioner for Patents 
Washington, D.C. 20231 
Box SEQUENCE 

Sir: 

In connection with a Sequence Listing submitted concurrently herewith, 
the undersigned hereby states that: 

1 . the submission, filed herewith in accordance with 37 C.F.R. § 
1.821(g), does not include new matter; and 

2. the content of the attached paper copy and the attached computer 
readable copy of the Sequence Listing, submitted in accordance with 37 C.F.R. § 
1.821(c) and (e), respectively, are the same. 

Respectfully submitted, 



>ate JJ Jo^D/Morrow 

Reg. No. 30,911 
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Art Unit: 
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THE UNITED STATES PATENT AND TRADEMARK OFFICi 
Andrew D. MURDIN 

CHLAMYDIA ANTIGENS AND CORRESPONDING DNA FRAGMENTS 
AND USES THEREOF 

To be assigned 

April 27, 2001 

Unassigned 

Unassigned 

PRELIMINARY AMENDMENT 



Commissioner for Patents 
Washington, D.C. 20231 



Prior to examination of the above-identified application, Applicants respectfully request that 
the following amendments be entered into the application: 



IN THE CLAIMS : 

Please cancel claims 1-24 in their entirety without prejudice or disclaimer and 
therefore insert new claims 25-63. 



15 25. (New) A nucleic acid molecule comprising a nucleic acid sequence which encodes a 

polypeptide selected from any of: 

(a) SEQ ID Nos: 27 to 45; 

(b) an immunogenic fragment comprising at least 12 consecutive amino acids from a 
polypeptide of (a); and 

2 0 (c) a polypeptide of (a) or (b) which has been modified without loss of 

immunogenicity, wherein said modified polypeptide is at least 75% identical in amino acid sequence 
to the corresponding polypeptide of (a) or (b). 

-1- 
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26. (New) A nucleic acid molecule comprising a nucleic acid sequence selected from any 
of: 

(a) SEQ ID Nos: 1 to 26; 

5 (b) a sequence which encodes a polypeptide encoded by any one of SEQ ID Nos: 1 to 

26; 

(c) a sequence comprising at least 38 consecutive nucleotides from any one of the 
nucleic acid sequences of (a) and (b); and 

(d) a sequence which encodes a polypeptide which is at least 75 % identical in amino 
10 acid sequence to any one of the polypeptides encoded by SEQ ID Nos: 1 to 26. 

27. (New) A nucleic acid molecule comprising a nucleic acid sequence which is anti- 
sense to the nucleic acid molecule of claim 25. 

28. (New) A nucleic acid molecule comprising a nucleic acid sequence which encodes a 
fusion protein, said fusion protein comprising a polypeptide encoded by a nucleic acid molecule 

15 according to claim 25 and a second polypeptide. 

29. (New) The nucleic acid molecule of claim 28 wherein the second polypeptide is a 
heterologous signal peptide. 

30. (New) The nucleic acid molecule of claim 28 wherein the second polypeptide has 
adjuvant activity. 

20 31. (New) A nucleic acid molecule according to claim 25 , operatively linked to one or 

more expression control sequences. 

32. (New) A vaccine comprising a vaccine vector and at least one first nucleic acid 

selected from any of: 

(i) SEQ ID Nos: 1 to 26; 

2 5 (ii) a nucleic acid sequence which encodes a polypeptide encoded by any one of SEQ 

ID Nos: 1 to 26; 

-2- 
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(iii) a nucleic acid sequence comprising at least 38 consecutive nucleotides from any 
one of the nucleic acid sequences of (i) and (ii); 



(iv) a nucleic acid sequence which encodes a polypeptide which is at least 75 % 
identical in amino acid sequence to the polypeptide encoded by any one of SEQ ID Nos: 1 to 26; 

5 (v) a nucleic acid sequence which encodes a polypeptide whose sequence is set forth 

in any one of SEQ ID Nos: 27 to 45; 

(vi) a nucleic acid sequence which encodes an immunogenic fragment comprising at 
least 12 consecutive amino acids from any one of SEQ ID Nos: 27 to 45; and 

(vii) a nucleic acid sequence which encodes a polypeptide as defined in (v) or an 

1 0 immunogenic fragment as defined in (vi) which has been modified without loss of immunogenicity, 
wherein said modified polypeptide or fragment is at least 75 % identical in amino acid sequence to the 
corresponding polypeptide of (v) or the corresponding fragment of (vi); 

wherein each first nucleic acid is capable of being expressed. 

33. (New) A vaccine comprising a vaccine vector and at least one first nucleic acid 

15 encoding a fusion protein, wherein the fusion protein comprises: 

(a) a first polypeptide selected from any of: 

(i) a polypeptide encoded by any one of SEQ ID Nos: 1 to 26; 

(ii) a polypeptide encoded by a nucleic acid sequence comprising at least 38 
consecutive nucleotides from any one of SEQ ID Nos: 1 to 26; 

2 0 (iii) a polypeptide which is at least 75 % identical in amino acid sequence to the 

polypeptide encoded by any one of SEQ ID Nos: 1 to 26; 

(iv) a polypeptide whose sequence is set forth in any one of SEQ ID Nos: 27 to 45; 

(v) an immunogenic fragment comprising at least 12 consecutive amino acids from 
any one of SEQ ID Nos: 27 to 45; and 

2 5 (vi) a polypeptide as defined (iv) or an immunogenic fragment as defined in (v) 

which has been modified without loss of immunogenicity, wherein said modified polypeptide or 
fragment is at least 75 % identical in amino acid sequence to the corresponding polypeptide of (iv) or 
the corresponding fragment of (v); and 

-3- 
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wherein each first nucleic acid is capable of being expressed. 

34 . (New) The vaccine of claim 33 wherein the second polypeptide is a heterologous 

signal peptide. 

5 35. (New) The vaccine of claim 33 wherein the second polypeptide has adjuvant activity. 

36. (New) The vaccine of claim 32 wherein each first nucleic acid is operatively linked to 
one or more expression control sequences. 

37. (New) A vaccine according to claim 32 wherein each first nucleic acid is expressed as 
a polypeptide, and wherein the vaccine comprises a second nucleic acid encoding an additional 

1 0 polypeptide which enhances the immune response to the polypeptide expressed by the first nucleic 
acid. 

38. (New) The vaccine of claim 37 wherein the second nucleic acid encodes an additional 
Chlamydia polypeptide. 

39. (New) A pharmaceutical composition comprising a nucleic acid according to claim 25 
15 and a pharmaceutical^ acceptable carrier. 

40. (New) A pharmaceutical composition comprising a vaccine according to claim 32 and 
a pharmaceutically acceptable carrier. 

41 . (New) A unicellular host transformed with the nucleic acid molecule of claim 31 . 

42. (New) An isolated nucleic acid probe of 5 to 100 nucleotides which hybridizes under 
2 0 stringent conditions to any one of nucleic acid molecules of SEQ ID Nos: 1 to 26, or to a 

complementary or anti-sense sequence of said nucleic acid molecule. 

43. (New) A primer of 10 to 40 nucleotides which hybridizes under stringent conditions 
to any one of nucleic acid molecules of SEQ ID Nos: 1 to 26, or to a homolog or complementary or 
anti-sense sequence of said nucleic acid molecule. 



25 



44. 



(New) A polypeptide encoded by a nucleic acid sequence according to claim 26. 
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45. (New) A polypeptide comprising an amino acid sequence selected from any of: 

(a) SEQ ID Nos: 27 to 45; 

(b) an immunogenic fragment comprising at least 12 consecutive amino acids from a 
polypeptide of (a); and 

5 (c) a polypeptide of (a) or (b) which has been modified without loss of 

immunogenic ity, wherein said modified polypeptide is at least 75% identical in amino acid sequence 
to the corresponding polypeptide of (a) or (b). 

46. (New) A fusion protein comprising a polypeptide of claim 44 and a second 
polypeptide. 

10 47. (New) The fusion protein of claim 46 wherein the second polypeptide is a 

heterologous signal peptide. 

48. (New) The fusion protein of claim 46 wherein the second polypeptide has adjuvant 
activity. 

49. (New) A method for producing a polypeptide, comprising the step of culturing a 
15 unicellular host of claim 41 and recovering the resultant polypeptide. 

50. (New) An antibody against the polypeptide of claim 44. 

51. (New) A vaccine comprising at least one first polypeptide selected from any of: 

(i) a polypeptide encoded by any one of SEQ ID Nos: 1 to 26; 

(ii) a polypeptide encoded by a nucleic acid sequence comprising at least 38 
2 0 consecutive nucleotides from any one of SEQ ID Nos: 1 to 26; 

(iii) a polypeptide which is at least 75 % identical in amino acid sequence to the 
polypeptide encoded by any one of SEQ ID Nos: 1 to 26; 

(iv) a polypeptide whose sequence is set forth in any one of SEQ ID Nos: 27 to 45; 

(v) an immunogenic fragment comprising at least 12 consecutive amino acids from 
2 5 any one of SEQ ID Nos: 27 to 45; and 

(vi) a polypeptide as defined in (iv) or an immunogenic fragment as defined in (v) 
which has been modified without loss of immunogenic ity, wherein said modified polypeptide or 

-5- 
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fragment is at least 75 % identical in amino acid sequence to the corresponding polypeptide of (iv) or 
the corresponding fragment of (v). 

52. (New) A vaccine comprising at least one fusion protein, wherein the fusion protein 
comprises: 

5 (a) a first polypeptide selected from any of: 

(i) a polypeptide encoded by any one of SEQ ID Nos: 1 to 26; 

(ii) a polypeptide encoded by a nucleic acid sequence comprising at least 38 
consecutive nucleotides from any one of SEQ ID Nos: 1 to 26; 

(iii) a polypeptide which is at least 75 % identical in amino acid sequence to the 
10 polypeptide encoded by any one of SEQ ID Nos: 1 to 26; 

(iv) a polypeptide whose sequence is set forth in any one of SEQ ID Nos: 27 to 45; 

(v) an immunogenic fragment comprising at least 12 consecutive amino acids from 
any one of SEQ ID Nos: 27 to 45; and 

(vi) a polypeptide as defined (iv) or an immunogenic fragment as defined in (v) 
15 which has been modified without loss of immunogenic ity, wherein said modified polypeptide or 

fragment is at least 75 % identical in amino acid sequence to the corresponding polypeptide of (iv) or 
the corresponding fragment of (v); and 

(b) a second polypeptide. 

53. (New) The vaccine of claim 52 wherein the second polypeptide is a heterologous 
2 0 signal peptide. 

54. (New) The vaccine of claim 52 wherein the second polypeptide has adjuvant activity. 

55. (New) A vaccine comprising at least one first polypeptide according to claim 44 and 
an additional polypeptide which enhances the immune response to the first polypeptide. 

56. (New) The vaccine of claim 55 wherein the additional polypeptide comprises a 
2 5 Chlamydia polypeptide. 

57. (New) A pharmaceutical composition comprising a polypeptide according to claim 44 
and a pharmaceutically acceptable carrier. 

-6- 
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58 ■ (New) A pharmaceutical composition comprising a vaccine according to claim 51 and 

a pharmaceutically acceptable carrier. 



59 - (New) A pharmaceutical composition comprising an antibody according to claim 50 

and a pharmaceutically acceptable carrier. 

60 ■ (New) A method for preventing or treating Chlamydia infection comprising 

administering to a patient an effective amount of: 

(a) a nucleic acid molecule according to claim 26; or 

(b) a vaccine comprising a vaccine vector and at least one first nucleic acid according 

to claim 26; or 

(c) a pharmaceutical composition comprising a nucleic acid according to claim 26 and 
a pharmaceutically acceptable carrier; or 

(d) a polypeptide encoded by a nucleic acid sequence according to claim 26; or 

(e) an antibody against a polypeptide encoded by a nucleic acid sequence according to 

claim 26. 

61 • (New) A method of detecting Chlamydia infection comprising the step of contacting a 
body fluid of a mammal to be tested, with a component selected from any one of: 

(a) a nucleic acid molecule according to claim 26; 

(b) a polypeptide encoded by a nucleic acid sequence according to claim 26; and 

(c) an antibody against a polypeptide encoded by a nucleic acid sequence according to 

claim 26. 

62 • (New) A diagnostic kit comprising instructions for use and a component selected from 
any one of: 

(a) a nucleic acid molecule according to claim 26; 

(b) a polypeptide encoded by a nucleic acid sequence according to claim 26; and 

(c) an antibody against a polypeptide encoded by a nucleic acid sequence according to claim 26. 

63. (New) A method for identifying a polypeptide of claim 44 which induces an immune 

response effective to prevent or lessen the severity of Chlamydia infection in a mammal previously 
immunized with polypeptide, comprising the steps of: 
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(a) immunizing a mouse with a polypeptide of claim 44; and 

(b) inoculating the immunized mouse with Chlamydia; 

wherein the polypeptide which prevents or lessens the severity of Chlamydia infection 
in the immunized mouse compared to a non-immunized control mouse is identified. 
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REMARKS 



The Examiner is respectfully requested to enter the above amendment prior to 
examination of the instant application. 

Respectfully submitted, 



Date April 27, 2001 



FOLEY & LARDNER Bernhard D. Saxe 

Washington Harbour Attorney for Applicant 

3000 K Street, N. W. , Suite 500 Registration No. 28,665 

Washington, D.C. 20007-5109 
Telephone: (202) 672-5414 
Facsimile: (202) 672-5399 
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TITLE OF INVENTION 

CHLAMYDIA ANTIGENS AND CORRESPONDING DNA FRAGMENTS AND USES 
THEREOF 

5 REFERENCE TO RELATED APPLICATIONS 



Provisional Application No. 60/106034, filed October 28, 1998, 
U.S. Provisional Application No . 60/106039, filed October 28, 
1998, U.S. Provisional Application No. 60/106042, filed October 

10 28, 1998, U.S. Provisional Application No. 60/106044, filed 
October 28, 1998, U.S. Provisional Application No. 60/106072, 
filed October 29, 1998, U.S. Provisional Application No. 
60/106073, filed October 29, 1998, U.S. Provisional Application 
No. 60/106074, filed October 29, 1998,U.S. Provisional 

15 Application No. 60/106087, filed October 29, 1998, U.S. 

Provisional Application No. 60/106587, filed November 2, 1998, 
U.S. Provisional Application No. 60/106588, filed November 2, 
1998, U.S. Provisional Application No. 60/107089, filed November 
2, 1998, U.S. Provisional Application No. 60/107034, filed 

20 November 2, 1998 and U.S. Provisional Application No. 60/107035, 
filed November 2, 1998. 

FIELD OF INVENTION 

The present invention relates to Chlamydia antigens 
25 and corresponding DNA molecules, which can be used to prevent 
and treat Chlamydia infection in mammals, such as humans. 

BACKGROUND OF THE INVENTION 



30 and structural similarities to gram-negative bacteria including 
a trilaminar outer membrane, which contains lipopolysaccharide 
and several membrane proteins that are structurally and 
functionally analogous to proteins found in E coll. They are 
obligate intra-cellular parasites with a unique biphasic life 



This application claims the benefit of U.S. 



Chlamydiae are prokaryotes. They exhibit morphologic 
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cycle consisting of a metabolically inactive but infectious 
extracellular stage and a replicating but non-infectious 
intracellular stage. The replicative stage of the life-cycle 
takes place within a membrane-bound inclusion which sequesters 
5 the bacteria away from the cytoplasm of the infected host cell. 

C. pneumoniae is a common human pathogen, originally 
described as the TWAR strain of Chlamydia pslttacl but 
subsequently recognised to be a new species. C. pneumoniae is 
antigenically, genetically and morphologically distinct from 

10 other chlamydia species (C. trachomatis , C. pecorum and C. 
psittacl) . It shows 10% or less DNA sequence homology with 
either of C . trachomatis or C. psittacl. 

C. pneumoniae is a common cause of community acquired 
pneumonia, only less frequent than Streptococcus pneumoniae and 

15 Mycoplasma pneumoniae (Grayston et al. (1995) Journal of 

Infectious Diseases 168:1231; Campos et al. (1995) Investigation 
of Ophthalmology and Visual Science 36:1477). It can also cause 
upper respiratory tract symptoms and disease, including 
bronchitis and sinusitis (Grayston et al. (1995) Journal of 

20 Infectious Diseases 168:1231; Grayston et al (1990) Journal of 
Infectious Diseases 161:618; Marrie (1993) Clinical Infectious 
Diseases. 18:501; Wang et al (1986) Chlamydial infections). 
Cambridge University Press, Cambridge, p. 329The great majority 
of the adult population (over 60%) has antibodies to C. 

25 pneumoniae (Wang et al (1986) Chlamydial infections. Cambridge 
University Press, Cambridge, p. 329) , indicating past infection 
which was unrecognized or asymptomatic. 

C. pneumoniae infection usually presents as an acute 
respiratory disease (i.e., cough, sore throat, hoarseness, and 

30 fever; abnormal chest sounds on auscultation) . For most 

patients, the cough persists for 2 to 6 weeks, and recovery is 
slow. In approximately 10% of these cases, upper respiratory 
tract infection is followed by bronchitis or pneumonia. 
Furthermore, during a C. pneumoniae epidemic, subsequent 
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co-infection with pneumococcus has been noted in about half of 
these pneumonia patients, particularly in the infirm and the 
elderly. As noted above, there is more and more evidence that 
C. pneumoniae infection is also linked to diseases other than 
5 respiratory infections . 

The reservoir for the organism is presumably people. 
In contrast to C. psittaci infections, there is no known bird or 
animal reservoir. Transmission has not been clearly defined. It 
may result from direct contact with secretions, from fomites, or 

10 from airborne spread. There is a long incubation period, which 
may last for many months. Based on analysis of epidemics, C. 
pneumoniae appears to spread slowly through a population (case- 
to-case interval averaging 30 days) because infected persons are 
inefficient transmitters of the organism. Susceptibility to C. 

15 pneumoniae is universal. Reinfections occur during adulthood, 
following the primary infection as a child. C. pneumoniae 
appears to be an endemic disease throughout the world, 
noteworthy for superimposed intervals of increased incidence 
(epidemics) that persist for 2 to 3 years. C . trachomatis 

20 infection does not confer cross-immunity to C. pneumoniae. 
Infections are easily treated with oral antibiotics, 
tetracycline or erythromycin (2 g/d, for at least 10 to 14 d) . A 
recently developed drug, azithromycin, is highly effective as a 
single-dose therapy against chlamydial infections. 

25 In most instances, C. pneumoniae infection is often 

mild and without complications, and up to 90% of infections are 
subacute or unrecognized. Among children in industrialized 
countries, infections have been thought to be rare up to the age 
of 5 y, although a recent study (E Normann et al, Chlamydia 

30 pneumoniae in children with acute respiratory tract infections, 
Acta Paediatrica, 1998, Vol 87, Iss 1, pp 23-27) has reported 
that many children in this age group show PCR evidence of 
infection despite being seronegative, and estimates a prevalence 
of 17-19% in 2-4 y olds. In developing countries, the 
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seroprevalence of C. pneumoniae antibodies among young children 
is elevated, and there are suspicions that C. pneumoniae may be 
an important cause of acute lower respiratory tract disease and 
mortality for infants and children in tropical regions of the 
5 world. 

From seroprevalence studies and studies of local 
epidemics, the initial C. pneumoniae infection usually happens 
between the ages of 5 and 20 y. In the USA, for example, there 
are estimated to be 30,000 cases of childhood pneumonia each 

10 year caused by C. pneumoniae. Infections may cluster among 
groups of children or young adults (e.g., school pupils or 
military conscripts) . 

C. pneumoniae causes 10 to 25% of community-acquired 
lower respiratory tract infections (as reported from Sweden, 

15 Italy, Finland, and the USA) . During an epidemic, C. pneumonia 
infection may account for 50 to 60% of the cases of pneumonia. 
During these periods, also, more episodes of mixed infections 
with S. pneumoniae have been reported. 
Reinfection during adulthood is common; the clinical 

20 presentation tends to be milder. Based on population 

seroprevalence studies, there tends to be increased exposure 
with age, which is particularly evident among men. Some 
investigators have speculated that a persistent, asymptomatic C. 
pneumoniae infection state is common. 

25 In adults of middle age or older, C. pneumoniae 

infection may progress to chronic bronchitis and sinusitis. A 
study in the USA revealed that the incidence of pneumonia caused 
by C. pneumoniae in persons younger than 60 years is 1 case per 
1,000 persons per year; but in the elderly, the disease 

30 incidence rose three-fold. C. pneumoniae infection rarely leads 
to hospitalization, except in patients with an underlying 
illness. 

Of considerable importance is the association of 
atherosclerosis and C. pneumoniae infection. There are several 
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epidemiological studies showing a correlation of previous 
infections with C. pneumoniae and heart attacks, coronary artery 
and carotid artery disease (Saikku et al. (1988) Lancet ; ii : 983 ; 
Thorn et al. (1992) JAMA 268:68; Linnanmaki et al. (1993), 
5 Circulation 87:1030; Saikku et al. (1992) Annals Internal 
Medicine 116:273; Melnick et al(1993) American Journal of 
Medicine 95:499). Moreover, the organisms has been detected in 
atheromas and fatty streaks of the coronary, carotid, peripheral 
arteries and aorta (Shor et al. (1992) South African. Medical 
10 Journal 82:158; Kuo et al. (1993) Journal of Infectious Diseases 
167:841; Kuo et al. (1993) Arteriosclerosis and Thrombosis 
13:1500; Campbell et al (1995) Journal of Infectious Diseases 
172:585; Chiu et al. Circulation, 1997 (In Press)). Viable C. 
pneumoniae has been recovered from the coronary and carotid 
15 artery (Ramirez et al (1996) Annals of Internal Medicine 

125:979; Jackson et al. Abst. K121, p272, 36 th ICAAC, 15-18 
Sept. 1996, New Orleans). Furthermore, it has been shown that 
C. pneumoniae can induce changes of atherosclerosis in a rabbit 
model (Fong et al (1997) Journal of Clinical Microbiolology 
20 35:48). Taken together, these results indicate that it is 

highly probable that C. pneumoniae can cause atherosclerosis in 
humans, though the epidemiological importance of chlamydial 
atherosclerosis remains to be demonstrated. 

A number of recent studies have also indicated an 
25 association between C. pneumoniae infection and asthma. 

Infection has been linked to wheezing, asthmatic bronchitis, 
adult-onset asthma and acute exacerbations of asthma in adults, 
and small-scale studies have shown that prolonged antibiotic 
treatment was effective at greatly reducing the severity of the 
30 disease in some individuals (Hahn DL, et al. Evidence for 
Chlamydia pneumoniae infection in steroid-dependent asthma. 
Ann Allergy Asthma Immunol. 1998 Jan; 80(1): 45-49.; Hahn DL, et 
al. Association of Chlamydia pneumoniae IgA antibodies with 
recently symptomatic asthma. Epidemiol Infect. 1996 Dec; 
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117(3): 513-517; Bjornsson E, et al. Serology of chlamydia in 
relation to asthma and bronchial hyperresponsiveness . Scand J 
Infect Dis. 1996; 28(1): 63-69.; Hahn DL . Treatment of Chlamydia 
pneumoniae infection in adult asthma: a before-after trial. J 
5 Fam Pract . 1995 Oct; 41(4): 345-351.; Allegra L, et al. Acute 
exacerbations of asthma in adults: role of Chlamydia pneumoniae 
infection. Eur Respir J. 1994 Dec; 7(12): 2165-2168.; Hahn DL, 
et al. Association of Chlamydia pneumoniae (strain TWAR) 
infection with wheezing, asthmatic bronchitis, and adult-onset 

10 asthma. JAMA. 1991 Jul 10; 266(2): 225-230). 

In light of these results a protective vaccine against 
C. pneumoniae infection would be of considerable importance. 
There is not yet an effective vaccine for any human chlamydial 
infection. It is conceivable that an effective vaccine can be 

15 developed using physically or chemically inactivated Chlamydiae. 
However, such a vaccine does not have a high margin of safety. 
In general, safer vaccines are made by genetically manipulating 
the organism by attenuation or by recombinant means. 
Accordingly, a major obstacle in creating an effective and safe 

20 vaccine against human chlamydial infection has been the paucity 
of genetic information regarding Chlamydia, specifically C. 
pneumoniae . 

Studies with C. trachomatis and C. psittaci indicate 
that safe and effective vaccine against Chlamydia is an 

25 attainable goal. For example, mice which have recovered from a 
lung infection with C. trachomatis are protected from 
infertility induced by a subsequent vaginal challenge (Pal et 
al. (1996) Infection and Immunity . 64 : 534 1 ) . Similarly, sheep 
immunized with inactivated C. psittaci were protected from 

30 subsequent chlamydial-induced abortions and stillbirths (Jones 
et al. (1995) Vaccine 13:715). Protection from chlamydial 
infections has been associated with Thl immune responses, 
particularly the induction of INFg - producing CD4+T-cells 
(Igietsemes et al. (1993) Immunology 5:317). The adoptive 
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transfer of CD4 + cell lines or clones to nude or SCID mice 
conferred protection from challenge or cleared chronic disease 
(Igietseme et al (1993) Regional Immunology 5:317; Magee et al 
(1993) Regional Immunology 5: 305), and in vivo depletion of 
5 CD4 + T cells exacerbated disease post-challenge (Landers et al 
(1991) Infection & Immunity 59:3774; Magee et al (1995) 
Infection & Immunity 63:516). However, the presence of 
sufficiently high titres of neutralising antibody at mucosal 
surfaces can also exert a protective effect (Cotter et al. 

10 (1995) Infection and Immunity 63:4704). 

Antigenic variation within the species C. pneumoniae 
is not well documented due to insufficient genetic information, 
though variation is expected to exist based on C. trachomatis . 
Serovars of C. trachomatis are defined on the basis of antigenic 

15 variation in MOMP, but published C. pneumoniae MOMP gene 

sequences show no variation between several diverse isolates of 
the organism (Campbell et al (1990) Infection and Immunity 
58:93; McCafferty et al (1995) Infection and Immunity 63:2387-9; 
Knudsen et ai (1996) Third Meeting of the European Society for 

20 Chlamydia Research, Vienna) . Regions of the protein known to be 
conserved in other chlamydial MOMPs are conserved in C. 
pneumoniae (Campbell et al (1990) Infection and Immunity 58:93; 
McCafferty et al (1995) Infection and Immunity 63:2387-9). One 
study has described a strain of C. pneumoniae with a MOMP of 

25 greater that usual molecular weight, but the gene for this has 
not been sequenced (Grayston et al. (1995) Journal of Infectious 
Diseases 168:1231). Partial sequences of outer membrane protein 
2 from nine diverse isolates were also found to be invariant 
(Ramirez et al (1996) Annals of Internal Medicine 125:979). The 

30 genes for HSP60 and HSP70 show little variation from other 

chlamydial species, as would be expected. The gene encoding a 
76kDa antigen has been cloned from a single strain of C. 
pneumoniae. It has no significant similarity with other known 
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chlamydial genes (Marrie (1993) Clinical Infectious Diseases. 
18:501) . 



pneumoniae are conserved across all Chl&mydlae, but 98kDa/ 
5 7 6kDa and 54kDa proteins appear to be C. pneumonia e-sp&cit ic 
(Campos et al. (1995) Investigation of Ophthalmology and Visual 
Science 36:1477; Marrie (1993) Clinical Infectious Diseases. 
16:501,- Wiedmann-Al-Ahmad M, et al. Reactions of polyclonal and 
neutralizing anti-p54 monoclonal antibodies with an isolated, 
10 Species-specific 54-Kliodal ton protein of Chlaraydi-a pneumoni-8 . 
Clin Diagn Lab Immunol. 1997 Nov; 4(6): 700-704). A 
publication relevant to 98KDa proteins is Perez Melgosa et al. 
FEMS Microbiology Letters. 112(2): 199-204. 1993. 



15 does show variation of blotting patterns between isolates, 

indicating that serotypes C. pneumoniae may exist (Ref 1,16). 
However, the results are potentially confounded by the 
infection status of the patients, since immunoblot profiles of 
a patient's sera change with time post-infection. An 

2 0 assessment of the number and relative frequency of any 

serotypes, and the defining antigens, is not yet possible. 



Isolating polynucleotide sequences of C. pneumoniae for use in 
preventing and treating Chlamydia infection. 

25 SUMMARY OF THE INVENTION 



polynucleotide molecules that encode Chlamydia polypeptides 
which can be used in methods to prevent, treat, and diagnose 
Chlamydia infection. In one form of the invention, the 
30 polynucleotide molecules are selected from DNA that encode 

polypeptides CPN100397 (SEQ ID Nos: 1 and 2), CPN10042I (SEQ ID 



Many antigens recognised by immune sera to C. 



Immunoblotting of isol^rnsR with sera from patients 



Accordingly, a need exists for identifying and 



The present invention provides purified and isolated 
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Nos: 3 and 4), CPN100422 (SEQ ID Nos: 5 and 6), CPN100424 (SEQ 
ID Nos: 7 and 8), CPN100426 <SEQ ID Nos: 9 and 10), CFN100506 
(SEQ ID Nos: 11 and 12), CPN100515 (SEQ ID Nos: 13 and 14), 
CPN100538 (SEQ ID Nos: 15 and 16), CPN100557 (SEQ ID Nos: 17 
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18), CPN100622 (SEQ ID Nos : 19 and 20), CPN100626 (SEQ ID Nos : 
21 and, 22), CPN100628 (SEQ ID Nos: 23 and 24) and CPN100630 
(SEQ ID Nos: 25 and 26). 

Another form of the invention provides polypeptides 
5 corresponding to the isolated DNA molecules. The amino acid 
sequences of the corresponding encoded polypeptides are shown 
for CPN100397 as SEQ ID Nos: 27 and 28, CPN100421 as SEQ ID No: 
29, CPN100422 as SEQ ID No: 30, CPN100424 as SEQ ID No: 31, 
CPN100426 as SEQ ID No: 32, CPN100508 as SEQ ID Nos: 33 and 34, 

10 CPN100515 as SEQ ID Nos: 35 and 36, CPN100538 as SEQ ID No: 37, 
CPN100557 as SEQ ID Nos: 38 and 39, CPN100622 as SEQ ID Nos: 40 
and 41, CPN100626 as SEQ ID No: 42, CPN100628 as SEQ ID No: 43 
and CPN100630 as SEQ ID Nos: 44 and 45. 

Those skilled in the art will readily understand that the 

15 invention, having provided the polynucleotide sequences encoding 
Chlamydia polypeptides, also provides polynucleotides encoding 
fragments derived from such peptides. Moreover, the invention 
is understood to provide mutants and derivatives of such 
polypeptides and fragments derived therefrom, which result from 

20 the addition, deletion, or substitution of non-essential amino 
acids as described herein. Those skilled in the art would also 
readily understand that the invention, having provided the 
polynucleotide sequences encoding Chlamydia polypeptides, 
further provides monospecific antibodies that specifically bind 

25 to such polypeptides 

The present invention has wide application and includes 
expression cassettes, vectors, and cells transformed or 
transfected with the polynucleotides of the invention. 
Accordingly, the present invention further provides (i) a method 

30 for producing a polypeptide of the invention in a recombinant 
host system and related expression cassettes, vectors, and 
transformed or transfected cells; (ii) a vaccine, or a live 
vaccine vector such as a pox virus, Salmonella typhimurlum, or 
Vijbrio cholerae vector, containing a polynucleotide of the 
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invention, such vaccines and vaccine vectors being useful for, 
e.g., preventing and treating Chlamydia infection, in 
combination with a diluent or carrier, and related 
pharmaceutical compositions and associated therapeutic and/or 
5 prophylactic methods; (iii) a therapeutic and/or prophylactic 
use of an RNA or DNA molecule of the invention, either in a 
naked form or formulated with a delivery vehicle, a polypeptide 
or combination of polypeptides, or a monospecific antibody of 
the invention, and related pharmaceutical compositions; (iv) a 
10 method for diagnosing the presence of Chlamydia in a biological 
sample, which can involve the use of a DNA or RNA molecule, a 
monospecific antibody, or a polypeptide of the invention; and 
(v) a method for purifying a polypeptide of the invention by 
antibody-based affinity chromatography. 

15 

BRIEF DESCRIPTION OF THE DRAWINGS 

The present invention will be further understood from the 
following description with reference to the drawings, in which: 

Figure 1 shows the nucleotide sequence of the CPN100397 
20 (SEQ ID No: 1 - entire sequence and SEQ ID No: 2 - coding 

sequence) and the deduced amino acid sequence of the CPN100397 
protein from Chlamydia pneumoniae (SEQ ID No: 27 and 28). 

Figure 2 shows the restriction enzyme analysis of the 
g ene encoding the C. pneumoniae CPN100397 gene. 
25 Figure 3 shows the nucleotide sequence of the CPN100421 

(SEQ ID No: 3 - entire sequence and SEQ ID No: 4 - coding 
sequence) and the deduced amino acid sequence of the CPN100421 
protein from Chlamydia pneumoniae (SEQ ID No: 29) . 

Figure 4 shows the restriction enzyme analysis of the 
30 gene encoding the C. pneumoniae CPN100421 gene. 

Figure 5 shows the nucleotide sequence of the CPN100422 
(SEQ ID No: 5 - entire sequence and SEQ ID No: 6 - coding 
sequence) and the deduced amino acid sequence of the CPN100422 
protein from Chlamydia pneumoniae (SEQ ID No: 30) . 
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Figure 6 shows the restriction enzyme analysis of the 
gene encoding the C. pneumoniae CPN100422 gene. 

Figure 7 shows the nucleotide sequence of the CPN10O424 
(SEQ ID No: 7 - entire sequence and SEQ ID No: 8 - coding 
5 sequence) and the deduced amino acid sequence of the CPN100424 
protein from Chlamydia pneumoniae (SEQ ID No: 31) . 

Figure 8 shows the restriction enzyme analysis of the 
gene encoding the C. pneumoniae CPN100424 gene. 

Figure 9 shows the nucleotide sequence of the CPN100426 
10 (SEQ ID No: 9 - entire sequence and SEQ ID No: 10 - coding 

sequence) and the deduced amino acid sequence of the CPN100426 
protein from Chlamydia pneumoniae (SEQ ID No: 32) . 

Figure 10 shows the restriction enzyme analysis of the 
gene encoding the C. pneumoniae CPN100426 gene. 
15 Figure 11 shows the nucleotide sequence of the CPN100508 

(SEQ ID No: 11 - entire sequence and SEQ ID No: 12 - coding 
sequence) and the deduced amino acid sequence of the 
CPN100508protein from Chlamydia pneumoniae (SEQ ID No: 33 - full 
length sequence and SEQ ID No: 34 - processed sequence) . 
20 Figure 12 shows the restriction enzyme analysis of the 

gene encoding the C. pneumoniae CPN100508 gene. 

Figure 13 shows the nucleotide sequence of the CPN100515 
(SEQ ID No: 13 - entire sequence and SEQ ID No: 14 - coding 
sequence) and the deduced amino acid sequence of the CPN100515 
25 protein from Chlamydia pneumoniae (SEQ ID No: 35 - full length 
sequence and SEQ ID No: 36 - processed sequence) . 

Figure 14 shows the restriction enzyme analysis of the 
gene encoding the C. pneumoniae CPN100515 gene. 

Figure 15 shows the nucleotide sequence of the CPN100538 
30 (SEQ ID No: 15 - entire sequence and SEQ ID No: 16 - coding 
sequence) and the deduced amino acid sequence of the CPN100538 
protein from Chlamydia pneumoniae (SEQ ID No: 37). 

Figure 16 shows the restriction enzyme analysis of the 
gene encoding the C. pneumoniae CPN100538 gene. 
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Figure 17 shows the nucleotide sequence of the CPN100557 
(SEQ ID No: 17 - entire sequence and SEQ ID No: 18 - coding 
sequence) and the deduced amino acid sequence of the CPN100557 
protein from Chlamydia pneumoniae (SEQ ID No: 38 - full length 
5 sequence and SEQ ID No: 39 - processed sequence). 

Figure 18 shows the restriction enzyme analysis of the 
gene encoding the C. pneumoniae CPN100557 gene. 

Figure 19 shows the nucleotide sequence of the CPN100622 
(SEQ ID No: 19 - entire sequence and SEQ ID No: 20 - coding 
10 sequence) and the deduced amino acid sequence of the CPN100622 
protein from Chlamydia pneumoniae (SEQ ID No: 40 - full length 
sequence and SEQ ID No: 41 - processed sequence) . 

Figure 20 shows the restriction enzyme analysis of the 
gene encoding the C. pneumoniae CPN100622 gene. 
15 Figure 21 shows the nucleotide sequence of the CPN100626 

(SEQ ID No: 21 - entire sequence and SEQ ID No: 22 - coding 
sequence) and the deduced amino acid sequence of the CPN100626 
protein from Chlamydia pneumoniae (SEQ ID No: 42) . 

Figure 22 shows the restriction enzyme analysis of the 
20 gene encoding the C. pneumoniae CPN100626 gene. 

Figure 23 shows the nucleotide sequence of the CPN100628 
(SEQ ID No: 23 - entire sequence and SEQ ID No: 24 - coding 
sequence) and the deduced amino acid sequence of the CPN100628 
protein from Chlamydia pneumoniae (SEQ ID No: 4 3) . 
25 Figure 24 shows the restriction enzyme analysis of the 

gene encoding the C. pneumoniae CPN100628 gene. 

Figure 25 shows the nucleotide sequence of the CPN100630 
(SEQ ID No: 25 - entire sequence and SEQ ID No: 26 - coding 
sequence) and the deduced amino acid sequence of the CPN100630 
30 protein from Chlamydia pneumoniae (SEQ ID No: 4 4 - full length 
sequence and SEQ ID No: 45 - processed sequence). 

Figure 26 shows the restriction enzyme analysis of the 
gene encoding the C. pneumoniae CPN100630 gene. 
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Figures 27 through 39 show an identification of T and B 
cell epitopes from the amino acid sequences shown in the 
foregoing figures. 

5 DETAILED DESCRIPTION OF INVENTION 

Open reading frames (ORFs) encoding chlamydial 
polypeptides have been identified from the C. pneumoniae genome. 
These polypeptides include polypeptides found permanently in 
the bacterial membrane structure, polypeptides present in the 

10 external vicinity of the bacterial membrane, polypeptides found 
permanently in the inclusion membrane structure, polypeptides 
present in the external vicinity of the inclusion membrane, and 
polypeptides released into the cytoplasm of the infected cell. 
These polypeptides can be used to prevent and treat Chlamydia 

15 infection. 

According to a first aspect of the invention, isolated 
polynucleotides are provided which encode the precursor and 
mature forms of Chlamydia polypeptides, whose amino acid 
sequences are selected from the group consisting of: SEQ ID 

20 Nos: 27 to 45. 

The term "isolated polynucleotide" is defined as a 
polynucleotide removed from the environment in which it 
naturally occurs. For example, a naturally-occurring DNA 
molecule present in the genome of a living bacteria or as part 

25 of a gene bank is not isolated, but the same molecule separated 
from the remaining part of the bacterial genome, as a result of, 
e.g., a cloning event (amplification), is isolated. Typically, 
an isolated DNA molecule is free from DNA regions (e.g., coding 
regions) with which it is immediately contiguous at the 5' or 3' 

30 end, in the naturally occurring genome. Such isolated 

polynucleotides may be part of a vector or a composition and 
still be defined as isolated in that such a vector or 
composition is not part of the natural environment of such 
polynucleotide . 
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The polynucleotide of the invention is either RNA or DNA 
(cDNA, genomic DNA, or synthetic DNA) , or modifications, 
variants, homologs or fragments thereof. The DNA is either 
double-stranded or single-stranded, and, if single-stranded, is 
5 either the coding strand or the non-coding (anti-sense) strand. 
Any one of the sequences that encode the polypeptides of the 
invention as shown in SEQ ID Nos : 1 to 26 is (a) a coding 
sequence, (b) a ribonucleotide sequence derived from 
transcription of (a) , or (c) a coding sequence which uses the 

10 redundancy or degeneracy of the genetic code to encode the same 
polypeptides. By "polypeptide" or "protein" is meant any chain 
of amino acids, regardless of length or post-translational 
modification (e.g., glycosylation or phosphorylation). Both 
terms are used interchangeably in the present application. 

15 Consistent with the first aspect of the invention, amino 

acid sequences are provided which are homologous to any one of 
SEQ ID Nos: 27 to 45. As used herein, "homologous amino acid 
sequence" is any polypeptide which is encoded, in whole or in 
part, by a nucleic acid sequence which hybridizes at 25-35°C 

20 below critical melting temperature (Tm) , to any portion of the 
nucleic acid sequences of SEQ ID Nos: 1 to 26. A homologous 
amino acid sequence is one that differs from an amino acid 
sequence shown in any one of SEQ ID Nos: 27 to 4 5 by one or 
more amino acid substitutions. Such a sequence also encompass 

25 serotypic variants (defined below) as well as sequences 
containing deletions or insertions which retain inherent 
characteristics of the polypeptide such as immunogenicity . 
Preferably, such a sequence is at least 75%, more preferably 
80%, and most preferably 90% identical to any one of SEQ ID 

30 Nos: 27 to 45. Homologous amino acid sequences include 

sequences that are identical or substantially identical to SEQ 
ID Nos: 27 to 45. By "amino acid sequence substantially 
identical" is meant a sequence that is at least 90%, preferably 
95%, more preferably 97%, and most preferably 99% identical to 
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an amino acid sequence of reference and that preferably differs 
from the sequence of reference by a majority of conservative 
amino acid substitutions. 

Conservative amino acid substitutions are substitutions 
5 among amino acids of the same class. These classes include, for 
example, amino acids having uncharged polar side chains, such as 
asparagine, glutamine, serine, threonine, and tyrosine; amino 
acids having basic side chains, such as lysine, arginine, and 
histidine; amino acids having acidic side chains, such as 
10 aspartic acid and glutamic acid; and amino acids having nonpolar 
side chains, such as glycine, alanine, valine, leucine, 
isoleucine, proline, phenylalanine, methionine, tryptophan, and 
cysteine . 

Homology is measured using sequence analysis software 

15 such as Sequence Analysis Software Package of the Genetics 
Computer Group, University of Wisconsin Biotechnology Center, 
1710 University Avenue, Madison, WI 53705. Amino acid sequences 
are aligned to maximize identity. Gaps may be artificially 
introduced into the sequence to attain proper alignment. Once 

20 the optimal alignment has been set up, the degree of homology is 
established by recording all of the positions in which the amino 
acids of both sequences are identical, relative to the total 
number of positions. 

Homologous polynucleotide sequences are defined in a 

25 similar way. Preferably, a homologous sequence is one that is 
at least 45%, more preferably 60%, and most preferably 85% 
identical to any one of coding sequences SEQ ID Nos: 1 to 26. 

Consistent with the first aspect of the invention, 
polypeptides having a sequence homologous to any one of SEQ ID 

30 Nos: 27 to 45 include naturally-occurring allelic variants, as 
well as mutants or any other non-naturally occurring variants 
that retain the inherent characteristics of the polypeptide of 
SEQ ID Nos: 27 to 45. 
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As is known in the art, an allelic variant is an 
alternate form of a polypeptide that is characterized as having 
a substitution, deletion, or addition of one or more amino acids 
that does not alter the biological function of the polypeptide. 
5 By "biological function" is meant the function of the 

polypeptide in the cells in which it naturally occurs, even if 
the function is not necessary for the growth or survival of the 
cells. For example, the biological function of a porin is to 
allow the entry into cells of compounds present in the 

10 extracellular medium. Biological function is distinct from 
antigenic property. A polypeptide can have more than one 
biological function. 

Allelic variants are very common in nature. For example, 
a bacterial species such as C. pneumoniae , is usually 

15 represented by a variety of strains that differ from each other 
by minor allelic variations. Indeed, a polypeptide that 
fulfills the same biological function in different strains can 
have an amino acid sequence (and polynucleotide sequence) that 
are not identical in each of the strains. Despite this 

20 variation, an immune response directed generally against many 
allelic variants has been demonstrated. In studies of the 
Chlamydial MOMP antigen, cross-strain antibody binding plus 
neutralization of infectivity occurs despite amino acid sequence 
variation of MOMP from strain to strain, indicating that the 

25 MOMP, when used as an immunogen, is tolerant of amino acid 
variations . 

Polynucleotides encoding homologous polypeptides or 
allelic variants are retrieved by polymerase chain reaction 
(PCR) amplification of genomic bacterial DNA extracted by 
30 conventional methods. This involves the use of synthetic 

oligonucleotide primers matching upstream and downstream of the 
5' and 3' ends of the encoding domain. Suitable primers are 
designed according to the nucleotide sequence information 
provided in SEQ ID Nos : 1 to 26. The procedure is as follows: a 
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primer is selected which consists of 10 to 40, preferably 15 to 
25 nucleotides. It is advantageous to select primers containing 
C and G nucleotides in a proportion sufficient to ensure 
efficient hybridization; i.e., an amount of C and G nucleotides 
5 of at least 40%, preferably 50% of the total nucleotide content. 
An alternative method for retrieving polynucleotides 
encoding homologous polypeptides or allelic variants is by 
hybridization screening of a DNA or RNA library. Hybridization 
procedures are well-known in the art and are described in 

10 Ausubel et al., (Ref 41), Silhavy et al. (Ref 43), and Davis et 
al. (ref 44) . Important parameters for optimizing hybridization 
conditions are reflected in a formula used to obtain the 
critical melting temperature above which two complementary DNA 
strands separate from each other (Ref 45) . For polynucleotides 

15 of about 600 nucleotides or larger, this formula is as follows: 
Tm = 81.5 + 0.5 x (% G+C) +1.6 log (positive ion concentration) 
- 0.6 x (% formamide) . Under appropriate stringency conditions, 
hybridization temperature (Th) is approximately 20 to 40°C, 20 
to 25°C, or, preferably 30 to 40°C below the calculated Tm. 

20 Those skilled in the art will understand that optimal 

temperature and salt conditions can be readily determined. 

For the polynucleotides of the invention, stringent 
conditions are achieved for both pre-hybridizing and hybridizing 
incubations (i) within 4-16 hours at 42°C, in 6 x SSC containing 

25 50% formamide, or (ii) within 4-16 hours at 65°C in an aqueous 
6 x SSC solution (1 M NaCI, 0.1 M sodium citrate (pH 7.0)). 

Useful homologs and fragments thereof that do not occur 
naturally are designed using known methods for identifying 
regions of an antigen that are likely to tolerate amino acid 

30 sequence changes and/or deletions. As an example, homologous 
polypeptides from different species are compared; conserved 
sequences are identified. The more divergent sequences are the 
most likely to tolerate sequence changes. Alternatively, 
sequences are modified such that they become more reactive to T- 
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and/or B-cells. (See Table below for identification of T- and 
B- epitopes.) Yet another alternative is to mutate a particular 
amino acid residue or sequence within the polypeptide in vitro, 
then screen the mutant polypeptides for their ability to prevent 
5 or treat Chlamydia infection according to the method outlined 
below . 

A person skilled in the art will readily understand that 
by following the screening process of this invention, it will be 
determined without undue experimentation whether a particular 
10 homolog of any of SEQ ID Nos: 27 to 45 may be useful in the 

prevention or treatment of Chlamydia infection. The screening 
procedure comprises the steps: 

(i) immunizing an animal, preferably mouse, with the 

test homolog or fragment; 
15 (ii) inoculating the immunized animal with Chlamydia; 

and 

(iii) selecting those homologs or fragments which confer 

protection against Chlamydia. 
By "conferring protection" is meant that there is a 
20 reduction is severity of any of the effects of Chlamydia 

infection, in comparison with a control animal which was not 
immunized with the test homolog or fragment. 

It has been previously demonstrated (Yang et . aJ. , 1993) 
that mice are susceptible to intranasal infection with different 
25 isolates of C. pneumoniae. Strain AR-39 (Grayston, 1989) was 
used in Balb/c mice as a challenge infection model to examine 
the capacity of chlamydia gene products delivered as naked DNA 
to elicit a protective response against a sublethal C. 
pneumoniae lung infection. Protective immunity is defined as an 
30 accelerated clearance of pulmonary infection. 

Groups of 7 to 9 week old male Balb/c mice (6 to 10 per 
group) were immunized intramuscularly (i.m.) plus intranasally 
(i.n.) with plasmid DNA containing the coding sequence of a 
C. pneumoniae polypeptide. Saline or the plasmid vector lacking 
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an inserted chlamydial gene was given to groups of control 
animals . 

For i.m. immunization alternate left and right 
quadriceps were injected with lOOug of DNA in 50ul of PBS on 
5 three occasions at 0, 3 and 6 weeks. For i.n. immunization, 

anaesthetized mice aspirated 50ul of PBS containing 50 jag DNA on 
three occasions at 0, 3 and 6 weeks. At week 8, immunized mice 
were inoculated i.n. with 5 x 10 5 IFU of C. pneumoniae, strain 
AR39 in lOOul of SPG buffer to test their ability to limit the 

10 growth of a sublethal C. pneumoniae challenge. 

Lungs were taken from mice at day 9 post-challenge and 
immediately homogenised in SPG buffer (7.5% sucrose, 5mM 
glutamate, 12.5mM phosphate pH7.5). The homogenate was stored 
frozen at -70°C until assay. Dilutions of the homogenate were 

15 assayed for the presence of infectious chlamydia by inoculation 
onto monolayers of susceptible cells. The inoculum was 
centrifuged onto the cells at 3000rpm for 1 hour, then the cells 
were incubated for three days at 35°C in the presence of lfig/ml 
cycloheximide . After incubation the monolayers were fixed with 

20 formalin and methanol then immunoperoxidase stained for the 

presence of chlamydial inclusions using convalescent sera from 
rabbits infected with C .pneumoniae and metal-enhanced DAB as a 
peroxidase substrate. 

Consistent with the first aspect of the invention, 

25 polypeptide derivatives are provided that are partial sequences 
of SEQ ID Nos : 27 to 45, partial sequences of polypeptide 
sequences homologous to SEQ ID Nos: 27 to 45, polypeptides 
derived from full-length polypeptides by internal deletion, and 
fusion proteins. 

30 It is an accepted practice in the field of immunology to 

use fragments and variants of protein immunogens as vaccines, as 
all that is required to induce an immune response to a protein 
is a small (e.g., 8 to 10 amino acid) immunogenic region of the 
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protein. Various short synthetic peptides corresponding to 
surface-exposed antigens of pathogens other than Chlamydia have 
been shown to be effective vaccine antigens against their 
respective pathogens, e.g. an 11 residue peptide of murine 
5 mammary tumor virus (Ref 38), a 16-residue peptide of Semliki 
Forest- virus (Ref 39) , and two overlapping peptides of 15 
residues each from canine parvovirus (Ref 40). 

Accordingly, it will be readily apparent to one skilled 
in the art, having read the present description, that partial 

10 sequences of SEQ ID Nos : 27 to 45 or their homologous amino acid 
sequences are inherent to the full-length sequences and are 
taught by the present invention. Such polypeptide fragments 
preferably are at least 12 amino acids in length. 
Advantageously, they are at least 20 amino acids, preferably at 

15 least 50 amino acids, more preferably at least 75 amino acids, 
and most preferably at least 100 amino acids in length. 

Polynucleotides of 30 to 600 nucleotides encoding partial 
sequences of sequences homologous to SEQ ID Nos: 27 to 45 are 
retrieved by PCR amplification using the parameters outlined 

20 above and using primers matching the sequences upstream and 
downstream of the 5' and 3' ends of the fragment to be 
amplified. The template polynucleotide for such amplification 
is either the full length polynucleotide homologous to one of 
SEQ ID Nos: 1 to 26, or a polynucleotide contained in a mixture 

25 of polynucleotides such as a DNA or RNA library. As an 
alternative method for retrieving the partial sequences, 
screening hybridization is carried out under conditions 
described above and using the formula for calculating Tm. Where 
fragments of 30 to 600 nucleotides are to be retrieved, the 

30 calculated Tm is corrected by subtracting ( 600 /polynucleotide 

size in base pairs) and the stringency conditions are defined by 
a hybridization temperature that is 5 to 10°C below Tm. Where 
oligonucleotides shorter than 20-30 bases are to be obtained, 
the formula for calculating the Tm is as follows: Tm = 4 x (G+C) 
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+ 2 (A+T) . For example, an 18 nucleotide fragment of 50% G+C 
would have an approximate Tm of 54 °C. Short peptides that are 
fragments of SEQ. ID Nos . 27 to 45 or their homologous 
sequences, are obtained directly by chemical synthesis (E. Gross 
5 and H. J. Meinhofer, 4 The Peptides: Analysis, Synthesis, 

Biology; Modern Techniques of Peptide Synthesis, John Wiley & 
Sons (1981), and M. Bodanzki, Principles of Peptide Synthesis, 
Springer -Verlag (1984)). 

Useful polypeptide derivatives, e.g., polypeptide 

10 fragments, are designed using computer-assisted analysis of 
amino acid sequences. This identifies probable surface- 
exposed, antigenic regions (Ref 37) . An analysis of the 13 
amino acid sequences contained in SEQ ID Nos: 27 to 45, based on 
the product of flexibility and hydrophobicity propensities using 

15 the program SEQSEE (Wishart DS, et al. w SEQSEE: a comprehensive 
program suite for protein sequence analysis." Comput Appl 
Biosci. 1994 Apr ; 10 (2) : 121-32) , reveal a number of potential B- 
and T-cell epitopes which may be used as a basis for selecting 
useful immunogenic fragments and variants. The results are 

20 shown in Figures 27 to 39. This analysis uses a reasonable 
combination of external surface features that is likely to be 
recognized by antibodies. Probable T-cell epitopes for HLA- 
A0201 MHC subclass were revealed by an algorithm written at 
Connaught Laboratories that emulates an approach developed at 

25 the NIH (Parker KC, et al. "Peptide binding to MHC class I 
molecules: implications for antigenic peptide prediction." 
Immunol Res 1995 ; 14 ( 1) : 34-57 ). 

Epitopes which induce a protective T cell-dependent 
immune response are present throughout the length of the 

30 polypeptide. However, some epitopes may be masked by secondary 
and tertiary structures of the polypeptide. To reveal such 
masked epitopes large internal deletions are created which 
remove much of the original protein structure and exposes the 
masked epitopes. Such internal deletions sometimes effects the 
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additional advantage of removing immunodominant regions of high 
variability among strains. Polynucleotides encoding polypeptide 
fragments and polypeptides having large internal deletions are 
constructed using standard methods (Ref 41). Such methods 
5 include standard PCR, inverse PCR, restriction enzyme treatment 
of cloned DNA molecules, or the method of Kunkel et al. (Ref 
42). Components for these methods and instructions for their 
use are readily available from various commercial sources such 
as Stratagene. Once the deletion mutants have been constructed, 

10 they are tested for their ability to prevent or treat Chlamydia 
infection as described above. 

As used herein, a fusion polypeptide is one that contains 
a polypeptide or a polypeptide derivative of the invention fused 
at the N- or C-terminal end to any other polypeptide 

15 (hereinafter referred to as a peptide tail) . A simple way to 
obtain such a fusion polypeptide is by translation of an in- 
frame fusion of the polynucleotide sequences, i.e., a hybrid 
gene. The hybrid gene encoding the fusion polypeptide is 
inserted into an expression vector which is used to transform or 

20 transfect a host cell. Alternatively, the polynucleotide 

sequence encoding the polypeptide or polypeptide derivative is 
inserted into an expression vector in which the polynucleotide 
encoding the peptide tail is already present. Such vectors and 
instructions for their use are commercially available, e.g. 

25 the pMal-c2 or pMal-p2 system from New England Biolabs, in 
which the peptide tail is a maltose binding protein, the 
glutathione-S-transf erase system of Pharmacia, or the His-Tag 
system available from Novagen. These and other expression 
systems provide convenient means for further purification of 

30 polypeptides and derivatives of the invention. 

An advantageous example of a fusion polypeptide is one 
where the polypeptide or homolog or fragment of the invention is 
fused to a polypeptide having adjuvant activity, such as subunit 
B of either cholera toxin or E. coll heat-labile toxin. Another 
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advantageous fusion is one where the polypeptide, homolog or 
fragment is fused to a strong T-cell epitope or B-cell epitope. 
Such an epitope may be one known in the art (e.g. the Hepatitis 
B virus core antigen, D.R. Millich et al . , "Antibody production 
5 to the nucleocapsid and envelope of the Hepatitis B virus primed 
by a single synthetic T cell site", Nature. 1987. 329:547-549), 
or one which has been identified in another polypeptide of the 
invention (Table ) . Consistent with this aspect of the 
invention is a fusion polypeptide comprising T- or B-cell 

10 epitopes from one of SEQ ID Nos : 27 to 45 or its homolog or 
fragment, wherein the epitopes are derived from multiple 
variants of said polypeptide or homolog or fragment, each 
variant differing from another in the location and sequence of 
its epitope within the polypeptide. Such a fusion is effective 

15 in the prevention and treatment of Chlamydia infection since it 
optimizes the T- and B-cell response to the overall polypeptide, 
homolog or fragment. 

To effect fusion, the polypeptide of the invention is 
fused to the N-, or preferably, to the C-terminal end of the 

20 polypeptide having adjuvant activity or T- or B-cell epitope. 
Alternatively, a polypeptide fragment of the invention is 
inserted internally within the amino acid sequence of the 
polypeptide having adjuvant activity. The T- or B-cell epitope 
may also be inserted internally within the amino acid sequence 

25 of the polypeptide of the invention. 

Consistent with the first aspect, the polynucleotides of 
the invention also encode hybrid precursor polypeptides 
containing heterologous signal peptides, which mature into 
polypeptides of the invention. By "heterologous signal peptide" 

30 is meant a signal peptide that is not found in naturally- 
occurring precursors of polypeptides of the invention. 

A polynucleotide molecule according to the invention, 
including RNA, DNA, or modifications or combinations thereof, 
have various applications. A DNA molecule is used, for example, 
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(i) in a process for producing the encoded polypeptide in a 
recombinant host system, (ii) in the construction of vaccine 
vectors such as poxviruses, which are further used in methods 
and compositions for preventing and/or treating Chlamydia 
5 infection, (iii) as a vaccine agent (as well as an RNA 

molecule) , in a naked form or formulated with a delivery vehicle 
and, (iv) in the construction of attenuated Chlamydia strains 
that can over-express a polynucleotide of the invention or 
express it in a non-toxic, mutated form. 

10 Accordingly, a second aspect of the invention encompasses 

(i) an expression cassette containing a DNA molecule of the 
invention placed under the control of the elements required for 
expression, in particular under the control of an appropriate 
promoter; (ii) an expression vector containing an expression 

15 cassette of the invention; (iii) a procaryotic or eucaryotic 
cell transformed or transfected with an expression cassette 
and/or vector of the invention, as well as (iv) a process for 
producing a polypeptide or polypeptide derivative encoded by a 
polynucleotide of the invention, which involves culturing a 

20 procaryotic or eucaryotic cell transformed or transfected with 
an expression cassette and/or vector of the invention, under 
conditions that allow expression of the DNA molecule of the 
invention and, recovering the encoded polypeptide or polypeptide 
derivative from the cell culture. 

25 A recombinant expression system is selected from 

procaryotic and eucaryotic hosts. Eucaryotic hosts include 
yeast ceils (e.g., Saccharomyces cerevlslae or Plchla pastorls) , 
mammalian cells (e.g., C0S1, NIH3T3, or JEG3 cells), arthropods 
cells (e.g., Spodoptera fruglperda ( SF9 ) cells), and plant 

30 cells. A preferred expression system is a procaryotic host such 
as E. coll. Bacterial and eucaryotic cells are available from a 
number of different sources including commercial sources to 
those skilled in the art, e.g., the American Type Culture 
Collection (ATCC; Rockville, Maryland) . Commercial sources of 
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cells used for recombinant protein expression also provide 
instructions for usage of the cells. 

The choice of the expression system depends on the 
features desired for the expressed polypeptide. For example, it 
5 may be useful to produce a polypeptide of the invention in a 
particular lipidated form or any other form. 

One skilled in the art would redily understand that not 
all vectors and expression control sequences and hosts would be 
expected to express equally well the polynucleotides of this 

10 invention. With the guidelines described below, however, a 

selection of vectors, expression control sequences and hosts may 
be made without undue experimentation and without departing from 
the scope of this invention. 

In selecting a vector, the host must be chosen that is 

15 compatible with the vector which is to exist and possibly 

replicate in it. Considerations are made with respect to the 
vector copy number, the ability to control the copy number, 
expression of other proteins such as antibiotic resistance. In 
selecting an expression control sequence, a number of variables 

20 are considered. Among the important variable are the relative 
strength of the sequence (e.g. the ability to drive expression 
under various conditions), the ability to control the sequence's 
function, compatibility between the polynucleotide to be 
expressed and the control sequence (e.g. secondary structures 

25 are considered to avoid hairpin structures which prevent 

efficient transcription) . In selecting the host, unicellular 
hosts are selected which are compatible with the selected 
vector, tolerant of any possible toxic effects of the expressed 
product, able to secrete the expressed product efficiently if 

30 such is desired, to be able to express the product in the 

desired conformation, to be easily scaled up, and to which ease 
of purification of the final product. 

The choice of the expression cassette depends on the 
host system selected as well as the features desired for the 
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expressed polypeptide. Typically, an expression cassette 
includes a promoter that is functional in the selected host 
system and can be constitutive or inducible; a ribosome binding 
site; a start codon (ATG) if necessary; a region encoding a 
5 signal peptide, e.g., a lipidation signal peptide; a DNA 
molecule of the invention; a stop codon; and optionally a 3' 
terminal region (translation and/or transcription terminator) . 
The signal peptide encoding region is adjacent to the 
polynucleotide of the invention and placed in proper reading 

10 frame. The signal peptide-encoding region is homologous or 

heterologous to the DNA molecule encoding the mature polypeptide 
and is compatible with the secretion apparatus of the host used 
for expression. The open reading frame constituted by the DNA 
molecule of the invention, solely or together with the signal 

15 peptide, is placed under the control of the promoter so that 
transcription and translation occur in the host system. 
Promoters and signal peptide encoding regions are widely known 
and available to those skilled in the art and include, for 
example, the promoter of Salmonella typhimurium (and 

20 derivatives) that is inducible by arabinose (promoter araB) and 
is functional in Gram-negative bacteria such as E. coll (as 
described in U.S. Patent No. 5,028,530 and in Cagnon et al. r 
(Ref 46)); the promoter of the gene of bacteriophage T7 encoding 
RNA polymerase, that is functional in a number of E. coli 

25 strains expressing T7 polymerase (described in U.S. Patent 
No. 4,952,496); OspA lipidation signal peptide ; and RlpB 
lipidation signal peptide (Ref 47). 

The expression cassette is typically part of an 
expression vector, which is selected for its ability to 

30 replicate in the chosen expression system. Expression vectors 
(e.g., plasmids or viral vectors) can be chosen, for example, 
from those described in Pouwels et al. (Cloning Vectors: A 
Laboratory Manual 1985, Supp. 1987). Suitable expression 
vectors can be purchased from various commercial sources. 
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Methods for transf orming/transf ecting host cells with 
expression vectors are well-known in the art and depend on the 
host system selected as described in Ausubel et al. t (Ref 41). 
Upon expression, a recombinant polypeptide of the 
5 invention (or a polypeptide derivative) is produced and remains 
in the intracellular compartment, is secreted/excreted in the 
extracellular medium or in the periplasmic space, or is embedded 
in the cellular membrane. The polypeptide is recovered in a 
substantially purified form from the cell extract or from the 

10 supernatant after centrif ugation of the recombinant cell 

culture. Typically, the recombinant polypeptide is purified by 
antibody-based affinity purification or by other well-known 
methods that can be readily adapted by a person skilled in the 
art, such as fusion of the polynucleotide encoding the 

15 polypeptide or its derivative to a small affinity binding 

domain. Antibodies useful for purifying by immunoaf f inity the 
polypeptides of the invention are obtained as described below. 

A polynucleotide of the invention can also be useful as a 
vaccine. There are two major routes, either using a viral or 

20 bacterial host as gene delivery vehicle (live vaccine vector) or 
administering the gene in a free form, e.g., inserted into a 
plasmid. Therapeutic or prophylactic efficacy of a 
polynucleotide of the invention is evaluated as described below. 

Accordingly, a third aspect of the invention provides (i) 

25 a vaccine vector such as a poxvirus, containing a DNA molecule 
of the invention, placed under the control of elements required 
for expression; (ii) a composition of matter comprising a 
vaccine vector of the invention, together with a diluent or 
carrier; specifically (iii) a pharmaceutical composition 

30 containing a therapeutically or prophylact ically effective 

amount of a vaccine vector of the invention; (iv) a method for 
inducing an immune response against Chlamydia in a mammal (e.g., 
a human; alternatively, the method can be used in veterinary 
applications for treating or preventing Chlamydia infection of 



SUBSTITUTE SHEET (RULE 26) 



o e o a 4 3 4- iJ: .„ o •?• u o 

WO 00/24765 PCT/CA99/00992 

28 

animals, e.g., cats or birds), which involves administering to 
the mammal an immunogenically effective amount of a vaccine 
vector of the invention to elicit a protective or therapeutic 
immune response to Chlamydia ; and particularly, (v) a method 
5 for preventing and/or treating a Chlamydia (e.g., C. 

trachomatis , C. psittaci , C. pneumonia, C. pecorum) infection, 
which involves administering a prophylactic or therapeutic 
amount of a vaccine vector of the invention to an infected 
individual. Additionally, the third aspect of the invention 

10 encompasses the use of a vaccine vector of the invention in the 
preparation of a medicament for preventing and/or treating 
Chlamydia infection. 

As used herein, a vaccine vector expresses one or several 
polypeptides or derivatives of the invention, as well as at 

15 least one additional Chlamydia antigen (??), fragment, homolog, 
mutant, or derivative thereof. The vaccine vector may express 
additionally a cytokine, such as interleukin-2 (IL-2) or 
interleukin-12 (IL-12), that enhances the immune response 
(adjuvant effect) . It is understood that each of the components 

20 to be expressed is placed under the control of elements required 
for expression in a mammalian cell. 

Consistent with the third aspect of the invention is a 
composition comprising several vaccine vectors, each of them 
capable of expressing a polypeptide or derivative of the 

25 invention. A composition may also comprise a vaccine vector 
capable of expressing an additional Chlamydia antigen, or a 
subunit, fragment, homolog, mutant, or derivative thereof; or a 
cytokine such as IL-2 or IL-12. 

Vaccination methods for treating or preventing infection 

30 in a mammal comprises use of a vaccine vector of the invention 
to be administered by any conventional route , particularly to a 
mucosal (e.g., ocular, intranasal, oral, gastric, pulmonary, 
intestinal, rectal, vaginal, or urinary tract) surface or via 
the parenteral (e.g., subcutaneous, intradermal, intramuscular, 
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intravenous, or intraperitoneal) route. Preferred routes depend 
upon the choice of the vaccine vector. Treatment may be 
effected in a single dose or repeated at intervals. The 
appropriate dosage depends on various parameters understood by 
5 skilled artisans such as the vaccine vector itself, the route of 
administration or the condition of the mammal to be vaccinated 
{weight, age and the like) . 

Live vaccine vectors available in the art include viral 
vectors such as adenoviruses and poxviruses as well as bacterial 
10 vectors, e.g., Shigella, Salmonella, Vibrio cholerae, 

Lactobacillus, Bacille bilie de Calmette-Guerin (BCG) , and 
Streptococcus . 

An example of an adenovirus vector, as well as a method 
for constructing an adenovirus vector capable of expressing a 

15 DNA molecule of the invention, are described in U.S. Patent No. 
4,920,209. Poxvirus vectors include vaccinia and canary pox 
virus, described in U.S. Patent No. 4,722,848 and U.S. Patent 
No. 5,364,773, respectively. (Also see, e.g., Tartaglia et al., 
Virology (1992) 188:217) for a description of a vaccinia virus 

20 vector and Taylor et al, Vaccine (1995) 13:539 for a reference 
of a canary pox.) Poxvirus vectors capable of expressing a 
polynucleotide of the invention are obtained by homologous 
recombination as described in Kieny et al., Nature (1984) 
312:163 so that the polynucleotide of the invention is inserted 

25 in the viral genome under appropriate conditions for expression 
in mammalian cells. Generally, the dose of vaccine viral 
vector, for therapeutic or prophylactic use, can be of from 
about lxlO 4 to about lxlO 11 , advantageously from about IxlO 7 to 
about lxlO 10 , preferably of from about lxlO 7 to about lxlO 9 

30 plaque-forming units per kilogram. Preferably, viral vectors 
are administered parenterally ; for example, in 3 doses, 4 weeks 
apart. It is preferable to avoid adding a chemical adjuvant to 
a composition containing a viral vector of the invention and 
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thereby minimizing the immune response to the viral vector 
itself. 

Non-toxicogenic Vibrio choleras mutant strains that are 
useful as a live oral vaccine are known. Mekalanos et al., 
5 Nature (1983) 306:551 and U.S. Patent No. 4,882,278 describe 
strains which have a substantial amount of the coding sequence 
of each of the two ctxA alleles deleted so that no functional 
cholerae toxin is produced. WO 92/11354 describes a strain in 
which the irgA locus is inactivated by mutation; this mutation 

10 can be combined in a single strain with ctxA mutations. WO 

94/1533 describes a deletion mutant lacking functional ctxA and 
attRSl DNA sequences. These mutant strains are genetically 
engineered to express heterologous antigens, as described in 
WO 94/19482. An effective vaccine dose of a Vijbrio cholerae 

15 strain capable of expressing a polypeptide or polypeptide 

derivative encoded by a DNA molecule of the invention contains 
about 1x10 s to about lxlO 9 , preferably about lxlO 6 to about IxlO 8 , 
viable bacteria in a volume appropriate for the selected route 
of administration. Preferred routes of administration include 

20 all mucosal routes; most preferably, these vectors are 
administered intranasally or orally. 

Attenuated Salmonella typhimurium strains, genetically 
engineered for recombinant expression of heterologous antigens 
or not, and their use as oral vaccines are described in 

25 Nakayama et al. (Bio/Technology (1988) 6:693) and WO 92/11361. 
Preferred routes of administration include all mucosal routes; 
most preferably, these vectors are administered intranasally or 
orally. 

Other bacterial strains used as vaccine vectors in the 
30 context of the present invention are described in High et al. , 
EMBO (1992) 11:1991 and Sizemore et al., Science (1995) 270:299 
(Shigella flexneri) ; Medaglini et al., Proc. Natl. Acad. Sci. 
USA (1995) 92:6868 (Streptococcus gordonii) , Flynn J.L., Cell. 
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Mol. Biol. (1994) 40 (suppl. I):31, WO 88/6626, WO 90/0594, WO 
91/13157, WO 92/1796, and WO 92/21376 (Bacille Calmette Guerin) . 

In bacterial vectors, the polynucleotide of the invention 
is inserted into the bacterial genome or remains in a free 
5 state as part of a plasmid. 

The composition comprising a vaccine bacterial vector of 
the present invention may further contain an adjuvant . A 
number of adjuvants are known to those skilled in the art. 
Preferred adjuvants are selected from the list provided below. 

10 Accordingly, a fourth aspect of the invention provides 

(i) a composition of matter comprising a polynucleotide of the 
invention, together with a diluent or carrier; (ii) a 
pharmaceutical composition comprising a therapeutically or 
prophylactically effective amount of a polynucleotide of the 

15 invention; (iii) a method for inducing an immune response 
against Chlamydia in a mammal by administration of an 
immunogenically effective amount of a polynucleotide of the 
invention to elicit a protective immune response to Chlamydia ; 
and particularly, (iv) a method for preventing and/or treating a 

20 Chlamydia (e.g., C. trachomatis, C. psittaci, C. pneumoniae, or 
C. pecorum) infection, by administering a prophylactic or 
therapeutic amount of a polynucleotide of the invention to an 
infected individual. Additionally, the fourth aspect of the 
invention encompasses the use of a polynucleotide of the 

25 invention in the preparation of a medicament for preventing 

and/or treating Chlamydia infection. A preferred use includes 
the use of a DNA molecule placed under conditions for expression 
in a mammalian cell, especially in a plasmid that is unable to 
replicate in mammalian cells and to substantially integrate in a 

30 mammalian genome. 

Use of the polynucleotides of the invention include their 
administration to a mammal as a vaccine, for therapeutic or 
prophylactic purposes. Such polynucleotides are used in the 
form of DNA as part of a plasmid that is unable to replicate in 
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a mammalian cell and unable to integrate into the mammalian 
genome. Typically, such a DNA molecule is placed under the 
control of a promoter suitable for expression in a mammalian 
cell. The promoter functions either ubiquitously or tissue- 
5 specifically. Examples of non-tissue specific promoters include 
the early Cytomegalovirus (CMV) promoter {described in U.S. 
Patent No. 4,168,062) and the Rous Sarcoma Virus promoter 
(described in Norton & Coffin, Molec. Cell Biol. (1985) 5:281). 
An example of a tissue-specific promoter is the desmin promoter 

10 which drives expression in muscle cells (Li et al. , Gene (1989) 
78:243, Li & Paulin, J. Biol. Chem. (1991) 266:6562 and Li & 
Paulin, J. Biol. Chem. (1993) 268:10403). Use of promoters is 
well-known to those skilled in the art. Useful vectors are 
described in numerous publications, specifically WO 94/21797 and 

15 Hartikka et al., Human Gene Therapy (1996) 7:1205. 

Polynucleotides of the invention which are used as a 
vaccine encode either a precursor or a mature form of the 
corresponding polypeptide. In the precursor form, the signal 
peptide is either homologous or heterologous. In the latter 

20 case, a eucaryotic leader sequence such as the leader sequence 
of the tissue-type plasminogen factor (tPA) is preferred. 

As used herein, a composition of the invention contains 
one or several polynucleotides with optionally at least one 
additional polynucleotide encoding another Chlamydia antigen 

25 such as urease subunit A, B, or both, or a fragment, derivative, 
mutant, or analog thereof. The composition may also contain an 
additional polynucleotide encoding a cytokine, such as 
interleukin-2 (IL-2) or interleukin-12 (IL-12) so that the 
immune response is enhanced. These additional polynucleotides 

30 are placed under appropriate control for expression. 

Advantageously, DNA molecules of the invention and/or additional 
DNA molecules to be included in the same composition, are 
present in the same plasmid. 
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Standard techniques of molecular biology for preparing 
and purifying polynucleotides are used in the preparation of 
polynucleotide therapeutics of the invention. For use as a 
vaccine, a polynucleotide of the invention is formulated 
5 according to various methods outlined below. 

One method utililizes the polynucleotide in a naked 
form/ free of any delivery vehicles. Such a polynucleotide is 
simply diluted in a physiologically acceptable solution such as 
sterile saline or sterile buffered saline, with or without a 
10 carrier. When present, the carrier preferably is isotonic, 

hypotonic, or weakly hypertonic, and has a relatively low ionic 
strength, such as provided by a sucrose solution, e.g., a 
solution containing 20% sucrose. 

An alternative method utilizes the polynucleotide in 
15 association with agents that assist in cellular uptake. 

Examples of such agents are (i) chemicals that modify cellular 
permeability, such as bupivacaine (see, e.g., WO 94/16737), (ii) 
liposomes for encapsulation of the polynucleotide, or 
(iii) cationic lipids or silica, gold, or tungsten 
20 microparticles which associate themselves with the 
polynucleotides . 

Anionic and neutral liposomes are well-known in the art 
(see, e.g., Liposomes: A Practical Approach, RPC New Ed, IRL 
press (1990), for a detailed description of methods for making 
25 liposomes) and are useful for delivering a large range of 

products, including polynucleotides. Cationic lipids are also 
known in the art and are commonly used for gene delivery. Such 
lipids include Lipofectin™ also known as DOTMA (N-[l-(2,3- 
dioleyloxy) propyl] -N, N, N-trimethylammonium chloride), DOTAP 
30 (1, 2-bis (oleyloxy) -3- ( trimethylammonio) propane) , DDAB 
(dimethyldioctadecylammonium bromide) , DOGS 

(dioctadecylamidologlycyl spermine) and cholesterol derivatives 
such as DC-Choi (3 beta- (N- (N *, N ' -dimethyl aminomethane ) - 
carbamoyl) cholesterol) . A description of these cationic lipids 
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can be found in EP 187,702, WO 90/11092, U.S. Patent 
No. 5,283,185, WO 91/15501, WO 95/26356, and U.S. Patent 
No. 5,527,928. Cationic lipids for gene delivery are preferably 
used in association with a neutral lipid such as DOPE (dioleyl 
5 phosphatidylethanolamine) , as described in WO 90/11092 as an 
example . 

Formulation containing cationic liposomes may optionally 
contain other transf ection-f acilitating compounds. A number of 
them are described in WO 93/18759, WO 93/19768, WO 94/25608, and 

10 WO 95/2397. They include spermine derivatives useful for 

facilitating the transport of DNA through the nuclear membrane 
(see, for example, WO 93/18759) and membrane-permeabilizing 
compounds such as GALA, Gramicidine S, and cationic bile salts 
(see, for example, WO 93/19768) . 

15 Gold or tungsten microparticles are used for gene 

delivery, as described in WO 91/359, WO 93/17706, and Tang et 
al. (Nature (1992) 356:152). The micropart icle-coated 
polynucleotide is injected via intradermal or intraepidermal 
routes using a needleless injection device ("gene gun"), such as 

20 those described in U.S. Patent No. 4,945,050, U.S. Patent 
No. 5,015,580, and WO 94/24263. 

The amount of DNA to be used in a vaccine recipient 
depends, e.g., on the strength of the promoter used in the DNA 
construct, the immunogenicity of the expressed gene product, the 

25 condition of the mammal intended for administration (e.g., the 
weight, age, and general health of the mammal) , the mode of 
administration, and the type of formulation. In general, a 
therapeutically or prophylactically effective dose from about 1 
pg to about 1 mg, preferably, from about 10 pg to about 800 pg 

30 and, more preferably, from about 25 ug to about 250 pg, can be 
administered to human adults. The administration can be 
achieved in a single dose or repeated at intervals. 

The route of administration is any conventional route 
used in the vaccine field. As general guidance, a 
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polynucleotide of the invention is administered via a mucosal 
surface, e.g., an ocular, intranasal, pulmonary, oral, 
intestinal, rectal, vaginal, and urinary tract surface; or via a 
parenteral route, e.g., by an intravenous, subcutaneous, 
5 intraperitoneal, intradermal, intraepidermal , or intramuscular 
route. The choice of administration route depends on the 
formulation that is selected. A polynucleotide formulated in 
association with bupivacaine is advantageously administered into 
muscles. When a neutral or anionic liposome or a cationic 

10 lipid, such as DOTMA or DC-Choi, is used, the formulation can be 
advantageously injected via intravenous, intranasal 
(aerosolization) , intramuscular, intradermal, and subcutaneous 
routes. A polynucleotide in a naked form can advantageously be 
administered via the intramuscular, intradermal, or sub- 

15 cutaneous routes. 

Although not absolutely required, such a composition can 
also contain an adjuvant. If so, a systemic adjuvant that does 
not require concomitant administration in order to exhibit an 
adjuvant effect is preferable such as, e.g., QS21, which is 

20 described in U.S. Patent No. 5,057,546. 

The sequence information provided in the present 
application enables the design of specific nucleotide probes and 
primers that are used for diagnostic purposes . Accordingly, a 
fifth aspect of the invention provides a nucleotide probe or 

25 primer having a sequence found in or derived by degeneracy of 
the genetic code from a sequence shown in any one of SEQ ID 
Nos:l to 26. 

The term "probe" as used in the present application 
refers to DNA (preferably single stranded) or RNA molecules (or 
30 modifications or combinations thereof) that hybridize under the 
stringent conditions, as defined above, to nucleic acid 
molecules having SEQ ID Nos : 1 to 26 or to sequences homologous 
to SEQ ID Nos : 1 to 26, or to their complementary or anti-sense 
sequences. Generally, probes are significantly shorter than 
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full-length sequences . Such probes contain from about 5 to 
about 100, preferably from about 10 to about 80, nucleotides. 
In particular, probes have sequences that are at least 75%, 
preferably at least 85%, more preferably 95% homologous to a 
5 portion of any of SEQ ID Nos : 1 to 26 or that are complementary 
to such sequences. Probes may contain modified bases such as 
inosine, methyl-5-deoxycytidine, deoxyuridine, dimethylamino-5- 
deoxyuridine , or diamino-2, 6-purine. Sugar or phosphate 
residues may also be modified or substituted. For example, a 

10 deoxyribose residue may be replaced by a polyamide (Nielsen et 
al. , Science (1991) 254:1497) and phosphate residues may be 
replaced by ester groups such as diphosphate, alkyl, 
arylphosphonate and phosphorothioate esters. In addition, the 
2 ' -hydroxyl group on ribonucleotides may be modified by 

15 including such groups as alkyl groups. 

Probes of the invention are used in diagnostic tests, as 
capture or detection probes. Such capture probes are 
conventionally immobilized on a solid support, directly or 
indirectly, by covalent means or by passive adsorption. A 

20 detection probe is labelled by a detection marker selected from: 
radioactive isotopes, enzymes such as peroxidase, alkaline 
phosphatase, and enzymes able to hydrolyze a chromogenic, 
fluorogenic, or luminescent substrate, compounds that are 
chromogenic, fluorogenic, or luminescent, nucleotide base 

25 analogs, and biotin. 

Probes of the invention are used in any conventional 
hybridization technique, such as dot blot (Maniatis et al., 
Molecular Cloning: A Laboratory Manual (1982) Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, New York), Southern blot 

30 (Southern, J. Mol. Biol. (1975) 98:503), northern blot 

(identical to Southern blot with the exception that RNA is used 
as a target), or the sandwich technique (Dunn et al., Cell 
(1977) 12:23). The latter technique involves the use of a 
specific capture probe and/or a specific detection probe with 
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nucleotide sequences that at least partially differ from each 
other . 

A primer is a probe of usually about 10 to about 
40 nucleotides that is used to initiate enzymatic polymerization 
5 of DNA in an amplification process (e.g., PCR) , in an elongation 
process, or in a reverse transcription method. Primers used in 
diagnostic methods involving PCR are labeled by methods known in 
the art. 

As described herein, the invention also encompasses (i) a 

10 reagent comprising a probe of the invention for detecting and/or 
identifying the presence of Chlamydia in a biological material; 
(ii) a method for detecting and/or identifying the presence of 
Chlamydia in a biological material, in which (a) a sample is 
recovered or derived from the biological material, (b) DNA or 

15 RNA is extracted from the material and denatured, and (c) 

exposed to a probe of the invention, for example, a capture, 
detection probe or both, under stringent hybridization 
conditions, such that hybridization is detected; and (iii) a 
method for detecting and/or identifying the presence of 

20 Chlamydia in a biological material, in which (a) a sample is 
recovered or derived from the biological material, (b) DNA is 
extracted therefrom, (c) the extracted DNA is primed with at 
least one, and preferably two, primers of the invention and 
amplified by polymerase chain reaction, and (d) the amplified 

25 DNA fragment is produced. 

It is apparent that disclosure of polynucleotide 
sequences of SEQ ID Nos : 1 to 26, their homolog, and partial 
sequences of either enable their corresponding amino acid 
sequences. Accordingly, a sixth aspect of the invention 

30 features a substantially purified polypeptide or polypeptide 
derivative having an amino acid sequence encoded by a 
polynucleotide of the invention. 

A "substantially purified polypeptide" as used herein is 
defined as a polypeptide that is separated from the environment 
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in which it naturally occurs and/or that is free of the majority 
of the polypeptides that are present in the environment in which 
it was synthesized. For example, a substantially purified 
polypeptide is free from cytoplasmic polypeptides. Those 
5 skilled in the art would readily understand that the 

polypeptides of the invention may be purified from a natural 
source, i.e., a Chlamydia strain, or produced by recombinant 
means . 

Consistent with the sixth aspect of the invention are 

10 polypeptides, homologs or fragments which are modified or 

treated to enhance their immunogenicity in the target animal, in 
whom the polypeptide, homolog or fragments are intended to 
confer protection against Chlamydia. Such modifications or 
treatments include: amino acid substitutions with an amino acid 

15 derivative such as 3-methyhistidine , 4-hydroxyproline, 5- 

hydroxylysine etc., modifications or deletions which are carried 
out after preparation of the polypeptide, homolog or fragment, 
such as the modification of free amino, carboxyl or hydroxyl 
side groups of the amino acids. 

20 Identification of homologous polypeptides or polypeptide 

derivatives encoded by polynucleotides of the invention which 
have specific antigenicity is achieved by screening for cross- 
reactivity with an antiserum raised against the polypeptide of 
reference having an amino acid sequence of any one of SEQ ID 

25 Nos: 27 to 45. The procedure is as follows: a monospecific 
hyperimmune antiserum is raised against a purified reference 
polypeptide, a fusion polypeptide (for example, an expression 
product of MBP, GST, or His-tag systems), or a synthetic peptide 
predicted to be antigenic. Where an antiserum is raised 

30 against a fusion polypeptide, two different fusion systems are 
employed. Specific antigenicity can be determined according to 
a number of methods, including Western blot (Towbin et al., 
Proc. Natl. Acad. Sci. USA (1979) 76:4350), dot blot, and ELISA, 
as described below. 
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In a Western blot assay, the product to be screened, 
either as a purified preparation or a total E. coll extract, is 
submitted to SDS-Page electrophoresis as described by Laemmli 
(Nature (1970) 227:680). After transfer to a nitrocellulose 
5 membrane, the material is further incubated with the 

monospecific hyperimmune antiserum diluted in the range of 
dilutions from about 1 : 5 to about 1: 5000, preferably from about 
1:100 to about 1:500. Specific antigenicity is shown once a 
band corresponding to the product exhibits reactivity at any of 

10 the dilutions in the above range. 

In an ELISA assay, the product to be screened is 
preferably used as the coating antigen. A purified preparation 
is preferred, although a whole cell extract can also be used. 
Briefly, about 100 ul of a preparation at about 10 pg protein/ml 

15 are distributed into wells of a 96-well polycarbonate ELISA 
plate. The plate is incubated for 2 hours at 37 °C then 
overnight at 4'C. The plate is washed with phosphate buffer 
saline (PBS) containing 0.05% Tween 20 (PBS/Tween buffer). The 
wells are saturated with 250 ul PBS containing 1% bovine serum 

20 albumin (BSA) to prevent non-specific antibody binding. After 1 
hour incubation at 37 °C, the plate is washed with PBS/Tween 
buffer. The antiserum is serially diluted in PBS/Tween buffer 
containing 0.5% BSA. 100 ul of dilutions are added per well. 
The plate is incubated for 90 minutes at 37 °C, washed and 

25 evaluated according to standard procedures. For example, a goat 
anti-rabbit peroxidase conjugate is added to the wells when 
specific antibodies were raised in rabbits. Incubation is 
carried out for 90 minutes at 37 °C and the plate is washed. The 
reaction is developed with the appropriate substrate and the 

30 reaction is measured by colorimetry (absorbance measured 
spectrophotometrically) . Under the above experimental 
conditions, a positive reaction is shown by O.D. values greater 
than a non immune control serum. 
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In a dot blot assay, a purified product is preferred, 
although a whole cell extract can also be used. Briefly, a 
solution of the product at about 100 pg/ml is serially two-fold 
diluted in 50 mM Tris-HCl (pH 7.5). 100 pi of each dilution are 
5 applied to a nitrocellulose membrane 0.45 um set in a 96-well 
dot blot apparatus (Biorad) . The buffer is removed by applying 
vacuum to the system. Wells are washed by addition of 50 mM 
Tris-HCl (pH 7.5) and the membrane is air-dried. The membrane 
is saturated in blocking buffer (50 mM Tris-HCl (pH 7.5) 0.15 M 

10 NaCl, 10 g/L skim milk) and incubated with an antiserum dilution 
from about 1:50 to about 1:5000, preferably about 1:500. The 
reaction is revealed according to standard procedures. For 
example, a goat anti-rabbit peroxidase conjugate is added to the 
wells when rabbit antibodies are used. Incubation is carried 

15 out 90 minutes at 37 °C and the blot is washed. The reaction is 
developed with the appropriate substrate and stopped. The 
reaction is measured visually by the appearance of a colored 
spot, e.g., by colorimetry. Under the above experimental 
conditions, a positive reaction is shown once a colored spot is 

20 associated with a dilution of at least about 1:5, preferably of 
at least about 1:500. 

Therapeutic or prophylactic efficacy of a polypeptide or 
derivative of the invention can be evaluated as described below. 
A seventh aspect of the invention provides (i) a composition of 

25 matter comprising a polypeptide of the invention together with a 
diluent or carrier; specifically (ii) a pharmaceutical 
composition containing a therapeutically or prophylactically 
effective amount of a polypeptide of the invention; (iii) a 
method for inducing an immune response against Chlamydia in a 

30 mammal, by administering to the mammal an immunogenically 

effective amount of a polypeptide of the invention to elicit a 
protective immune response to Chlamydia; and particularly, (iv) 
a method for preventing and/or treating a Chlamydia (e.g., C. 
trachomatis . C. psittaci , C. pneumoniae, or C. pecorum) 
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infection, by administering a prophylactic or therapeutic amount 
of a polypeptide of the invention to an infected individual. 
Additionally, the seventh aspect of the invention encompasses 
the use of a polypeptide of the invention in the preparation of 
5 a medicament for preventing and/or treating Chlamydia infection. 
As used herein, the immunogenic compositions of the 
invention are administered by conventional routes known the 
vaccine field, in particular to a mucosal (e.g., ocular, 
intranasal, pulmonary, oral, gastric, intestinal, rectal, 

10 vaginal, or urinary tract) surface or via the parenteral (e.g., 
subcutaneous, intradermal, intramuscular, intravenous, or 
intraperitoneal) route. The choice of administration route 
depends upon a number of parameters, such as the adjuvant 
associated with the polypeptide. If a mucosal adjuvant is used, 

15 the intranasal or oral route is preferred. If a lipid 

formulation or an aluminum compound is used, the parenteral 
route is preferred with the sub-cutaneous or intramuscular route 
being most preferred. The choice also depends upon the nature 
of the vaccine agent. For example, a polypeptide of the 

20 invention fused to CTB or LTB is best administered to a mucosal 
surface . 

As used herein, the composition of the invention contains 
one or several polypeptides or derivatives of the invention. 
The composition optionally contains at least one additional 

25 Chlamydia antigen, or a subunit, fragment, homolog, mutant, or 
derivative thereof. 

For use in a composition of the invention, a polypeptide 
or derivative thereof is formulated into or with liposomes, 
preferably neutral or anionic liposomes, microspheres, ISCOMS, 

30 or virus-like-particles (VLPs) to facilitate delivery and/or 
enhance the immune response. These compounds are readily 
available to one skilled in the art; for example, see Liposomes: 
A Practical Approach (supra) . 
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Adjuvants other than liposomes and the like are also used 
and- are known in the art. Adjuvants may protect the antigen 
from rapid dispersal by sequestering it in a local deposit, or 
they may contain substances that stimulate the host to secrete 
5 factors that are chemotactic for macrophages and other 

components of the immune system. An appropriate selection can 
conventionally be made by those skilled in the art, for example, 
from those described below. 

Treatment is achieved in a single dose or repeated as 

10 necessary at intervals, as can be determined readily by one 

skilled in the art. For example, a priming dose is followed by 
three booster doses at weekly or monthly intervals. An 
appropriate dose depends on various parameters including the 
recipient {e.g., adult or infant), the particular vaccine 

15 antigen, the route and frequency of administration, the 

presence/absence or type of adjuvant, and the desired effect 
(e.g., protection and/or treatment), as can be determined by one 
skilled in the art. In general, a vaccine antigen of the 
invention is administered by a mucosal route in an amount from 

20 about 10 ug to about 500 mg, preferably from about 1 mg to about 
200 mg. For the parenteral route of administration, the dose 
usually does not exceed about 1 mg, preferably about 100 ug . 

When used as vaccine agents, polynucleotides and 
polypeptides of the invention may be used sequentially as part 

25 of a multistep immunization process. For example, a mammal is 
initially primed with a vaccine vector of the invention such as 
a pox virus, e.g., via the parenteral route, and then boosted 
twice with the polypeptide encoded by the vaccine vector, e.g., 
via the mucosal route. In another example, liposomes associated 

30 with a polypeptide or derivative of the invention is also used 
for priming, with boosting being carried out mucosally using a 
soluble polypeptide or derivative of the invention in 
combination with a mucosal adjuvant (e.g., LT) . 
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A polypeptide derivative of the invention is also used in 
accordance with the seventh aspect as a diagnostic reagent for 
detecting the presence of ant ±- Chlamydia antibodies, e.g., in a 
blood sample. Such polypeptides are about 5 to about 80, 
5 preferably about 10 to about 50 amino acids in length. They are 
either labeled or unlabeled, depending upon the diagnostic 
method. Diagnostic methods involving such a reagent are 
described below. 

Upon expression of a DNA molecule of the invention, a 

10 polypeptide or polypeptide derivative is produced and purified 
using known laboratory techniques. As described above, the 
polypeptide or polypeptide derivative may be produced as a 
fusion protein containing a fused tail that facilitates 
purification. The fusion product is used to immunize a small 

15 mammal, e.g., a mouse or a rabbit, in order to raise antibodies 
against the polypeptide or polypeptide derivative (monospecific 
antibodies) . Accordingly, an eighth aspect of the invention 
provides a monospecific antibody that binds to a polypeptide or 
polypeptide derivative of the invention. 

20 By "monospecific antibody" is meant an antibody that is 

capable of reacting with a unique naturally-occurring Chlamydia 
polypeptide. An antibody of the invention is either polyclonal 
or monoclonal. Monospecific antibodies may be recombinant, 
e.g., chimeric (e.g., constituted by a variable region of murine 

25 origin associated with a human constant region) , humanized (a 
human immunoglobulin constant backbone together with 
hypervariable region of animal, e.g., murine, origin), and/or 
single chain. Both polyclonal and monospecific antibodies may 
also be in the form of immunoglobulin fragments, e.g., F(ab) '2 

30 or Fab fragments. The antibodies of the invention are of any 
isotype, e.g., IgG or IgA, and polyclonal antibodies are of a 
single isotype or a mixture of isotypes . 

Antibodies against the polypeptides, homologs or 
fragments of the present invention are generated by immunization 
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of a mammal with a composition comprising said polypeptide, 
homolog or fragment. Scu antibodies may be polyclonal or 
monoclonal. Methods to produce polyclonal or monoclonal 
antibodies are well known in the art. For a review, see 
5 "Antibodies, A Laboratory Manual, Cold Spring Harbor Laboratory, 
Eds. E. Harlow and D. Lane (1988), and D.E. Yelton et al . , 1981. 
Ann. Rev. Biochem. 50:657-680. For monoclonal antibodies, see 
Kohl and Milstein?... 

The antibodies of the invention, which are raised to a 

10 polypeptide or polypeptide derivative of the invention, are 
produced and identified using standard immunological assays, 
e.g., Western blot analysis, dot blot assay, or ELISA (see, 
e.g., Coligan et al. , Current Protocols in Immunology (1994) 
John Wiley & Sons, Inc., New York, NY) . The antibodies are used 

15 in diagnostic methods to detect the presence of a Chlamydia 
antigen in a sample, such as a biological sample. The 
antibodies are also used in affinity chromatography for 
purifying a polypeptide or polypeptide derivative of the 
invention. As is discussed further below, such antibodies may 

20 be used in prophylactic and therapeutic passive immunization 
methods . 

Accordingly, a ninth aspect of the invention provides 
(i) a reagent for detecting the presence of Chlamydia in a 
biological sample that contains an antibody, polypeptide, or 

25 polypeptide derivative of the invention; and (ii) a diagnostic 
method for detecting the presence of Chlamydia in a biological 
sample, by contacting the biological sample with an antibody, a 
polypeptide, or a polypeptide derivative of the invention, such 
that an immune complex is formed, and by detecting such complex 

30 to indicate the presence of Chlamydia in the sample or the 
organism from which the sample is derived. 

Those skilled in the art will readily understand that the 
immune complex is formed between a component of the sample and 
the antibody, polypeptide, or polypeptide derivative, whichever 
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is used, and that any unbound material is removed prior to 
detecting the complex. It is understood that a polypeptide 
reagent is useful for detecting the presence of anti -Chlamydia 
antibodies in a sample, e.g., a blood sample, while an antibody 
5 of the invention is used for screening a sample, such as a 
gastric extract or biopsy, for the presence of Chlamydia 
polypeptides . 

For diagnostic applications, the reagent (i.e., the 
antibody, polypeptide, or polypeptide derivative of the 

10 invention) is either in a free state or immobilized on a solid 
support, such as a tube, a bead, or any other conventional 
support used in the field. Immobilization is achieved using 
direct or indirect means. Direct means include passive 
adsorption (non-covalent binding) or covalent binding between 

15 the support and the reagent. By "indirect means" is meant that 
an anti-reagent compound that interacts with a reagent is first 
attached to the solid support. For example, if a polypeptide 
reagent is used, an antibody that binds to it can serve as an 
anti-reagent, provided that it binds to an epitope that is not 

20 involved in the recognition of antibodies in biological samples. 
Indirect means may also employ a ligand-receptor system, for 
example, where a molecule such as a vitamin is grafted onto the 
polypeptide reagent and the corresponding receptor immobilized 
on the solid phase. This is illustrated by the biotin- 

25 streptavidin system. Alternatively, a peptide tail is added 
chemically or by genetic engineering to the reagent and the 
grafted or fused product immobilized by passive adsorption or 
covalent linkage of the peptide tail. 

Such diagnostic agents may be included in a kit which 

30 also comprises instructions for use. The reagent are labeled 
with a detection means which allows for the detection of the 
reagent when it is bound to its target. The detection means may 
be a fluorescent agent such as fluorescein isocyanate or 
fluorescein isothiocyanate, or an enzyme such as horse radish 
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peroxidase or luciferase or alkaline phosphatase, or a 
radioactive element such as 125 I or 51 Cr. 

Accordingly, a tenth aspect of the invention provides a 
process for purifying, from a biological sample, a polypeptide 
5 or polypeptide derivative of the invention, which involves 
carrying out antibody-based affinity chromatography with the 
biological sample, wherein the antibody is a monospecific 
antibody of the invention. 

For use in a purification process of the invention, the 

10 antibody is either polyclonal or monospecific, and preferably is 
of the IgG type. Purified IgGs is prepared from an antiserum 
using standard methods (see, e.g., Coligan et al., supra). 
Conventional chromatography supports, as well as standard 
methods for grafting antibodies, are described in, e.g., 

15 Antibodies: A Laboratory Manual, D. Lane, E. Harlow, Eds. (1988) 
and outlined below. 

Briefly, a biological sample, such as an C. pneumoniae 
extract preferably in a buffer solution, is applied to a 
chromatography material, preferably equilibrated with the buffer 

20 used to dilute the biological sample so that the polypeptide or 
polypeptide derivative of the invention (i.e., the antigen) is 
allowed to adsorb onto the material. The chromatography 
material, such as a gel or a resin coupled to an antibody of the 
invention, is in either a batch form or a column. The unbound 

25 components are washed off and the antigen is then eluted with an 
appropriate elution buffer, such as a glycine buffer or a buffer 
containing a chaotropic agent, e.g., guanidine HC1, or high salt 
concentration (e.g., 3 M MgCl 2 ). Eluted fractions are recovered 
and the presence of the antigen is detected, e.g., by measuring 

30 the absorbance at 280 nm. 

An eleventh aspect of the invention provides (i) a 
composition of matter comprising a monospecific antibody of the 
invention, together with a diluent or carrier; (ii) a 
pharmaceutical composition comprising a therapeutically or 
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prophylactically effective amount of a monospecific antibody of 
the invention, and (iii) a method for treating or preventing a 
Chlamydia (e.g., C. trachomatis, C. psittaci, C. pneumoniae or 
C. pecorum) infection, by administering a therapeutic or 
5 prophylactic amount of a monospecific antibody of the invention 
to an infected individual. Additionally, the eleventh aspect of 
the invention encompasses the use of a monospecific antibody of 
the invention in the preparation of a medicament for treating or 
preventing Chlamydia infection. 

10 The monospecific antibody is either polyclonal or 

monoclonal, preferably of the IgA isotype (predominantly) . In 
passive immunization, the antibody is administered to a mucosal 
surface of a mammal, e.g., the gastric mucosa, e.g., orally or 
intragastrically, advantageously, in the presence of a 

15 bicarbonate buffer. Alternatively, systemic administration, not 
requiring a bicarbonate buffer, is carried out. A monospecific 
antibody of the invention is administered as a single active 
component or as a mixture with at least one monospecific 
antibody specific for a different Chlamydia polypeptide. The 

20 amount of antibody and the particular regimen used are readily 
determined by one skilled in the art. For example, daily 
administration of about 100 to 1,000 mg of antibodies over one 
week, or three doses per day of about 100 to 1,000 mg of 
antibodies over two or three days, are effective regimens for 

25 most purposes. 

Therapeutic or prophylactic efficacy are evaluated using 
standard methods in the art, e.g., by measuring induction of a 
mucosal immune response or induction of protective and/or 
therapeutic immunity, using, e.g., the C. pneumoniae mouse 

30 model. Those skilled in the art will readily recognize that the 
C. pneumoniae strain of the model may be replaced with another 
Chlamydia strain. For example, the efficacy of DNA molecules 
and polypeptides from C. pneumoniae is preferably evaluated in a 
mouse model using C. pneumoniae strain. Protection is 
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determined by comparing the degree of Chlamydia infection to 
that of a control group. Protection is shown when infection is 
reduced by comparison to the control group. Such an evaluation 
is made for polynucleotides, vaccine vectors, polypeptides and 
5 derivatives thereof, as well as antibodies of the invention. 

Adjuvants useful in any of the vaccine compositions 
described above are as follows. 

Adjuvants for parenteral administration include aluminum 
compounds, such as aluminum hydroxide, aluminum phosphate, and 

10 aluminum hydroxy phosphate. The antigen is precipitated with, 
or adsorbed onto, the aluminum compound according to standard 
protocols. Other adjuvants, such as RIBI (ImmunoChem, Hamilton, 
MT) , is used in parenteral administration. 

Adjuvants for mucosal administration include bacterial 

15 toxins, e.g., the cholera toxin (CT) , the E. coli heat-labile 
toxin (LT) , the Clostridium difficile toxin A and the pertussis 
toxin (PT) , or combinations, subunits, toxoids, or mutants 
thereof such as a purified preparation of native cholera toxin 
subunit B (CTB) . Fragments, homologs, derivatives, and fusions 

20 to any of these toxins are also suitable, provided that they 
retain adjuvant activity. Preferably, a mutant having reduced 
toxicity is used. Suitable mutants are described, e.g., in WO 
95/17211 (Arg-7-Lys CT mutant), WO 96/6627 (Arg-192-Gly LT 
mutant), and WO 95/34323 (Arg-9-Lys and Glu-129-Gly PT mutant). 

25 Additional LT mutants that are used in the methods and 

compositions of the invention include, e.g., Ser-63-Lys, Ala-69- 
Gly, Glu-110-Asp, and Glu-112-Asp mutants. Other adjuvants, 
such as a bacterial monophosphoryl lipid A (MPLA) of, e.g., E. 
coli, Salmonella minnesota , Salmonella typhimurium, or Shigella 

30 flexneri ; saponins, or polylactide glycolide (PLGA) 

microspheres, is also be used in mucosal administration. 

Adjuvants useful for both mucosal and parenteral 
administrations include polyphosphazene (WO 95/2415) , DC-chol (3 
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b- (N- (N • , N ' -dimethyl aminomethane ) -carbamoyl ) cholesterol; U.S. 
Patent No. 5,283,185 and WO 96/14831) and QS-21 (WO 88/9336). 

Any pharmaceutical composition of the invention 
containing a polynucleotide, a polypeptide, a polypeptide 
5 derivative, or an antibody of the invention, is manufactured in 
a conventional manner. In particular, it is formulated with a 
pharmaceutical^ acceptable diluent or carrier, e.g., water or a 
saline solution such as phosphate buffer saline. In general, a 
diluent or carrier is selected on the basis of the mode and 

10 route of administration, and standard pharmaceutical practice. 
Suitable pharmaceutical carriers or diluents, as well as 
pharmaceutical necessities for their use in pharmaceutical 
formulations, are described in .Remington's Pharmaceutical 
Sciences, a standard reference text in this field and in the 

15 USP/NF. 

The invention also includes methods in which Chlamydia 
infection are treated by oral administration of a Chlamydia 
polypeptide of the invention and a mucosal adjuvant, in 
combination with an antibiotic, an antacid, sucralfate, or a 

20 combination thereof. Examples of such compounds that can be 
administered with the vaccine antigen and the adjuvant are 
antibiotics, including, e.g., macrolides, tetracyclines, and 
derivatives thereof (specific examples of antibiotics that can 
be used include azithromycin or doxicyclin or immunomodulators 

25 such as cytokines or steroids) . In addition, compounds 

containing more than one of the above-listed components coupled 
together, are used. The invention also includes compositions 
for carrying out these methods, i.e., compositions containing a 
Chlamydia antigen (or antigens) of the invention, an adjuvant, 

30 and one or more of the above-listed compounds, in a 
pharmaceutically acceptable carrier or diluent. 

Amounts of the above-listed compounds used in the methods 
and compositions of the invention are readily determined by one 
skilled in the art. Treatment /immunization schedules are also 
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known and readily designed by one skilled in the art. For 
example, the non-vaccine components can be administered on days 
1-14, and the vaccine antigen + adjuvant can be administered on 
days 7, 14, 21, and 28. 
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CLAIMS : 



1. A nucleic acid molecule comprising a nucleic acid 
sequence which encodes a polypeptide selected from any of: 

(a) SEQ ID Nos: 27 to 45; 

5 (b) an immunogenic fragment comprising at least 12 

consecutive amino acids from a polypeptide of (a) ; and 

(c) a polypeptide of (a) or (b) which has been 
modified without loss of immunogenicity , wherein said modified 
polypeptide is at least 75% identical in amino acid sequence to 
10 the corresponding polypeptide of (a) or (b) . 

\ 

2. A nucleic acid molecule comprising a nucleic acid 
sequence selected from any of: 

(a) SEQ ID Nos: 1 to 26; 

(b) a sequence which encodes a polypeptide encoded by 
15 any one of SEQ ID Nos: 1 to 26; 

(c) a sequence comprising at least 38 consecutive 
nucleotides from any one of the nucleic acid sequences of (a) 
and (b) ; and > 

(d) a sequence which encodes a polypeptide which is 
20 at least 75% identical in amino acid sequence to any one of the 

polypeptides encoded by SEQ ID Nos: 1 to 26. 

3. A nucleic acid molecule comprising a nucleic acid 
sequence which is anti-sense to the nucleic acid molecule of 
claim 1. 

25 4. A nucleic acid molecule comprising a nucleic acid 

sequence which encodes a fusion protein, said fusion protein 
comprising a polypeptide encoded by a nucleic acid molecule 
according to claim 1 and a second polypeptide. 



5. The nucleic acid molecule of claim 4 wherein the 
second polypeptide is a heterologous signal peptide. 

6. The nucleic acid molecule of claim 4 wherein the 
second polypeptide has adjuvant activity. 

7. A nucleic acid molecule according to claim 1, 
operatively linked to one or more expression control sequences 

8. A vaccine comprising a vaccine vector and at least 
one first nucleic acid selected from any of: 

(i) SEQ ID Nos: 1 to 26; 

(ii) a nucleic acid sequence which encodes a 
polypeptide encoded by any one of SEQ ID Nos: 1 to 26; 

(iii) a nucleic acid sequence comprising at least 38 
consecutive nucleotides from any one of the nucleic acid 
sequences of (i) and (ii); 

(iv) a nucleic acid sequence which encodes a 
polypeptide which is at least 75% identical in amino acid 
sequence to the polypeptide encoded by any one of SEQ ID Nos: 
to 26; 

(v) a nucleic acid sequence which encodes a 
polypeptide whose sequence is set forth in any one of SEQ ID 
Nos: 27 to 45; 

(vi) a nucleic acid sequence which encodes an 
immunogenic fragment comprising at least 12 consecutive amino 
acids from any one of SEQ ID Nos: 27 to 45; and 

(vii) a nucleic acid sequence which encodes a 
polypeptide as defined in (v) or an immunogenic fragment as 
defined in (vi) which has been modified without loss of 
immunogenicity , wherein said modified polypeptide or fragment 
is at least 75% identical in amino acid sequence to the 



corresponding polypeptide of (v) or the corresponding fragment 
of (vi) ; 

wherein each first nucleic acid is capable of being 

expressed . 

9. A vaccine comprising a vaccine vector and at least 

one first nucleic acid encoding a fusion protein, wherein the 
fusion protein comprises: 

(a) a first polypeptide selected from any of: 

(i) a polypeptide encoded by any one of SEQ ID Nos : 1 

to 2 6; 

(ii) a polypeptide encoded by a nucleic acid sequence 
comprising at least 38 consecutive nucleotides from any one of 
SEQ ID Nos: 1 to 26; 

(iii) a polypeptide which is at least 75% identical 
in amino acid sequence to the polypeptide encoded by any one of 
SEQ ID Nos: 1 to 26; 

(iv) a polypeptide whose sequence is set forth in any 
one of SEQ ID Nos: 27 to 45; 

(v) an immunogenic fragment comprising at least 12 
consecutive amino acids from any one of SEQ ID Nos: 27 to 45; 
and 

(vi) a polypeptide as defined (iv) or an immunogenic 
fragment as defined in (v) which has been modified without loss 
of iramunogenicity, wherein said modified polypeptide or 
fragment is at least 75% identical in amino acid sequence to 
the corresponding polypeptide of (iv) or the corresponding 
fragment of (v) ; and 



(b) a second polypeptide; 
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wherein each first nucleic acid is capable of being 

expressed . 

10. The vaccine of claim 9 wherein the second polypeptide 

is a heterologous signal peptide. 

5 11. The vaccine of claim 9 wherein the second polypeptide 

has adjuvant activity. 

12. The vaccine of claim 8 wherein each first nucleic 

acid is operatively linked to one or more expression control 
sequences . 

10 13. A vaccine according to claim 8 wherein each first 

nucleic acid is expressed as a polypeptide, and wherein the 
vaccine comprises a second nucleic acid encoding an additional 
polypeptide which enhances the immune response to the 
polypeptide expressed by the first nucleic acid. 

15 14. The vaccine of claim 13 wherein the second nucleic 

acid encodes an additional Chlamydia polypeptide. 

15. A pharmaceutical composition comprising a nucleic 

acid according to claim 1 and a pharmaceutically acceptable 
carrier . 

20 16. A pharmaceutical composition comprising a vaccine 

according to claim 8 and a pharmaceutically acceptable carrier. 

IV. A unicellular host transformed with the nucleic acid 

molecule of claim 7 . 

18. An isolated nucleic acid probe of 5 to 100 

25 nucleotides which hybridizes under stringent conditions to any 
one of nucleic acid molecules of SEQ ID Nos : 1 to 26, or to a 
complementary or anti-sense sequence of said nucleic acid 
molecule . 
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19. 

under 



A primer of 10 to 40 nucleotides which 
stringent conditions to any one of nucleic 



hybridizes 
acid molecules 
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of SEQ ID Nos: 1 to 26, or to a homolog or complementary or 
anti-sense sequence of said nucleic acid molecule. 

20. A polypeptide encoded by a nucleic acid sequence 

according to claim 2 . 

5 21. A polypeptide comprising an amino acid sequence 

selected from any of: 

(a) SEQ ID Nos: 27 to 45; 

(b) an immunogenic fragment comprising at least 12 
consecutive amino acids from a polypeptide of (a) ; and 

10 (c) a polypeptide of (a) or (b) which has been 

modified without loss of immunogenicity , wherein said modified 
polypeptide is at least 75% identical in amino acid sequence to 
the corresponding polypeptide of (a) or (b) . 

22. A fusion protein comprising a polypeptide of claim 20 
15 and a second polypeptide. 

23. The fusion protein of claim 22 wherein the second 
polypeptide is a heterologous signal peptide. 

24. The fusion protein of claim 22 wherein the second 
polypeptide has adjuvant activity. 

20 25. A method for producing a polypeptide of claim 20, 

comprising the step of culturing a unicellular host of 
claim 17 . 

26. An antibody against the polypeptide of claim 20. 

27 . A vaccine comprising at least one first polypeptide 
25 selected from any of: 

(i) a polypeptide encoded by any one of SEQ ID Nos: 1 

to 26; 



(ii) a polypeptide encoded by a nucleic acid sequence 
comprising at least 38 consecutive nucleotides from any one of 
SEQ ID Nos: 1 to 26; 

(iii) a polypeptide which is at least 75% identical 
in amino acid sequence to the polypeptide encoded by any one of 
SEQ ID Nos: 1 to 26; 

(iv) a polypeptide whose sequence is set forth in any 
one of SEQ ID Nos: 27 to 45; 

(v) an immunogenic fragment comprising at least 12 
consecutive amino acids from any one of SEQ ID Nos: 27 to 45; 
and 

(vi) a polypeptide as defined in (iv) or an 
immunogenic fragment as defined in (v) which has been modified 
without loss of immunogenicity , wherein said modified 
polypeptide or fragment is at least 75% identical in amino acid 
sequence to the corresponding polypeptide of (iv) or the 
corresponding fragment of (v) . 

28. A vaccine comprising at least one fusion protein, 

wherein the fusion protein comprises: 

(a) a first polypeptide selected from any of: 

(i) a polypeptide encoded by any one of SEQ ID Nos: 1 

to 26; 

(ii) a polypeptide encoded by a nucleic acid sequence 
comprising at least 38 consecutive nucleotides from any one of 
SEQ ID Nos: 1 to 26; 

(iii) a polypeptide which is at least 75% identical 
in amino acid sequence to the polypeptide encoded by any one of 
SEQ ID Nos: 1 to 26; 

(iv) a polypeptide whose sequence is set forth in any 
one of SEQ ID Nos: 27 to 45; 
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(v) an immunogenic fragment comprising at least 12 
consecutive amino acids from any one of SEQ ID Nos : 27 to 45; 
and 

(vi) a polypeptide as defined (iv) or an immunogenic 
5 fragment as defined in (v) which has been modified without loss 

of immunogenicity , wherein said modified polypeptide or 
fragment is at least 75% identical in amino acid sequence to 
the corresponding polypeptide of (iv) or the corresponding 
fragment of (v) ; and 

10 (b) a second polypeptide. 

29. The vaccine of claim 28 wherein the second 
polypeptide is a heterologous signal peptide. 

30. The vaccine of claim 28 wherein the second 
polypeptide has adjuvant activity. 

15 31. A vaccine comprising at least one first polypeptide 

according to claim 20 and an additional polypeptide which 
enhances the immune response to the first polypeptide. 

32. The vaccine of claim 31 wherein the additional 

polypeptide comprises a Chlamydia polypeptide. 

20 33. A pharmaceutical composition comprising a polypeptide 

according to claim 20 and a pharmaceutical ly acceptable 
carrier . 

34 . A pharmaceutical composition comprising a vaccine 

according to claim 27 and a pharmaceut ically acceptable 
25 carrier. 



35. 

according 
carrier . 



A pharmaceutical composition comprising an antibody 
to claim 26 and a pharmaceut ically acceptable 



36. 

infection 
amount of 



A method for preventing or treating Chlamydia 
comprising administering to a patient an effective 



(a) a nucleic acid molecule according to claim 2; 

(b) a vaccine comprising a vaccine vector and at 
least one first nucleic acid according to claim 2 ; 

(c) a pharmaceutical composition comprising a nucleic 
acid according to claim 2 and a pharmaceut ically acceptable 
carrier; 

(d) a polypeptide encoded by a nucleic acid sequence 
according to claim 2; or 

(e) an antibody against a polypeptide encoded by a 
nucleic acid sequence according to claim 2. 

37 . A method of detecting Chlamydia infection comprising 
the step of contacting a body fluid of a mammal to be tested, 
with a component selected from any one of: 

(a) a nucleic acid molecule according to claim 2; 

(b) a polypeptide encoded by a nucleic acid sequence 
according to claim 2; and 

(c) an antibody against a polypeptide encoded by a 
nucleic acid sequence according to claim 2. 

38. A diagnostic kit comprising instructions for use and 
a component selected from any one of: 

(a) a nucleic acid molecule according to claim 2; 

(b) a polypeptide encoded by a nucleic acid sequence according 
to claim 2; and 

(c) an antibody against a polypeptide encoded by a nucleic acid 
sequence according to claim 2 . 



39. A method for identifying a polypeptide of claim 20 

which induces an immune response effective to prevent or lessen 
the severity of Chlamydia infection in a mammal previously 
immunized with polypeptide, comprising the steps of: 

(a) immunizing a mouse with a polypeptide of claim 

2 0 ; and 

(b) inoculating the immunized mouse with Chlamydia/ 

wherein the polypeptide which prevents or lessens the 
severity of Chlamydia infection in the immunized mouse compared 
to a non-immunized control mouse is identified. 
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CLAIMS 

1. A nucleic acid molecule comprising a nucleic acid 
sequence which encodes a polypeptide selected from any of: 

5 (a) SEQ ID Nos : 27 to 45; 

(b) an immunogenic fragment comprising at least 12 
consecutive amino acids from a polypeptide of (a) ; and 

(c) a polypeptide of (a) or (b) which has been modified to 
improve its immunogenicity , wherein said modified 

10 polypeptide is at least 75% identical in amino acid 

sequence to the corresponding polypeptide of (a) or 
(b) . 

2. A nucleic acid molecule comprising a nucleic acid 
15 sequence selected from any of: 

(a) SEQ ID Nos: 1 to 26; 

(b) a sequence which encodes a polypeptide encoded by any 
one of SEQ ID Nos: 1 to 26; 

(c) a sequence comprising at least 38 consecutive 

20 nucleotides from any one of the nucleic acid sequences 

of (a) and (b) ; and 

(d) a sequence which encodes a polypeptide which is at 
least 75% identical in amino acid sequence to any one 
of the polypeptides encoded by SEQ ID Nos: 1 to 26. 
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A nucleic acid molecule comprising a nucleic acid 
sequence which encodes a fusion protein, said fusion 
protein comprising a polypeptide encoded by a nucleic acid 
molecule according to claim 1 and an additional 
polypeptide . 

A nucleic acid molecule according to claim 1, 
operatively linked to one or more expression control 
sequences . 

A vaccine comprising at least one first nucleic acid 
according to any one of claims 1 to 4 and a vaccine vector 
wherein each first nucleic acid is expressed as a 
polypeptide, the vaccine optionally comprising a second 
nucleic acid encoding an additional polypeptide which 
enhances the immune response to the polypeptide expressed 
by said first nucleic acid. 

The vaccine of claim 5 wherein the second nucleic acid 
encodes an additional Chlamydia polypeptide. 

A pharmaceutical composition comprising a nucleic acid 
according to any one of claims 1 to 5 and a 
pharmaceutically acceptable carrier. 
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8. A pharmaceutical composition comprising a vaccine 

according to claim 5 or 6 and a pharmaceutically acceptable 
carrier . 



5 9. A unicellular host transformed with the nucleic acid 

molecule of claim 4 . 



10. A nucleic acid probe of 5 to 100 nucleotides which 

hybridizes under stringent conditions to any one of nucleic 
10 acid molecules of SEQ ID Nos : 1 to 26, or to a homolog or 

complementary or anti-sense sequence of said nucleic acid 
molecule . 



11. A primer of 10 to 40 nucleotides which hybridizes 

15 under stringent conditions to any one of nucleic acid 

molecules of SEQ ID Nos: 1 to 26, or to a homolog or 
complementary or anti-sense sequence of said nucleic acid 
molecule . 



20 12. A polypeptide encoded by a nucleic acid sequence 

according to any one of claims 1 to 4 . 



13. A polypeptide comprising an amino acid sequence 

selected from any of: 
25 (a) SEQ ID Nos: 27 to 45; 
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(b) an immunogenic fragment comprising at least 12 
consecutive amino acids from a polypeptide of (a); and 

(c) a polypeptide of (a) or <b) which has been modified to 
improve its immunogenicity, wherein said modified 

5 polypeptide is at least 75% identical in amino acid 

sequence to the corresponding polypeptide of (a) or 
(b) . 



14. A fusion polypeptide comprising a polypeptide of claim 

10 12 or 13 and an additional polypeptide. 



15. A method for producing a polypeptide of claim 12 or 
13, comprising the step of culturing a unicellular host 
according to claim 9. 

15 

16. An antibody against the polypeptide of any one of 
claims 12 to 14. 



17. A vaccine comprising at least one first polypeptide 

20 according to any one of claims 12 to 14 and a 

pharmaceutically acceptable carrier, optionally comprising 
a second polypeptide which enhances the immune response to 
the first polypeptide. 



25 18. The vaccine of claim 17 wherein the second polypeptide 

comprises an additional Chlamydia polypeptide. 
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A pharmaceutical composition comprising a polypeptide 
according to any one of claims 12 to 14 and a 
pharmaceutically acceptable carrier. 

A pharmaceutical composition comprising a vaccine 
according to claim 17 or 18 and a pharmaceutically 
acceptable carrier. 

A pharmaceutical composition comprising an antibody 
according to claim 16 and a pharmaceutically acceptable 
carrier . 

A method for preventing or treating Chlamydia 
infection using: 

(a) the nucleic acid of any one of claims 1 to 4; 

(b) the vaccine of any one of claims 5, 6, 17 and 18; 

(c) the pharmaceutical composition of any one of claims 7, 
8, 19 to 21; 

(d) the polypeptide of any one of claims 12 to 14; or 

(e) the antibody of claim 16. 

A method of detecting Chlamydia infection comprising 
the step of assaying a body fluid of a mammal to be tested, 
with a component selected from any one of: 
(a) the nucleic acid of any one of claims 1 to 4; 
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(b) the polypeptide of any one of claims 12 to 14; and 

(c) the antibody of claim 16. 



24. A diagnostic kit comprising instructions for use and 

component selected from any one of: 

(a) the nucleic acid of any one of claims 1 to 4; 

(b) the polypeptide of any one of claims 12 to 14; and 
the antibody of claim 16. 
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4), CPN100422 (SEQ ID Nos: 4 and 6), CPN100424 (SEQ ID Nos: 7 and 8), CPN100426 (SEQ ID Nos: 9 and 10), CPN100508 
O < SE Q ID Nos: 11 and 12 >- CPN100515 (SEQ ID Nos: 13 and 14), CPN100538 (SEQ ID Nos: 15 and 16), CPN100557 (SEQ ID 

Nos: 17 and 18), CPN100622 (SEQ ID Nos: 19 and 20), CPN100626 (SEQ ID Nos: 21 and 22), CPN100628 (SEQ ID Nos: 23 and 
^ 24) and CPN 100630 (SEQ ID Nos: 25 and 26). 
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AND USES THEREOF 
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Figure 1: CPN1003 97 

attttaacgt gcgtatcatt tgtgactaag agatagactt gctttcttta tctatcttct 60 

gtattggaaa gaaagcccct tgagggaaaa aaaggttgtt atg aag att cca etc 115 

Met Lys lie Pro Leu 

1 5 / 

cgc ttt tta ttg ata tea tta gta cct acg ctt tct atg teg aat tta 163 
Arq Phe Leu Leu lie Ser Leu Val Pro Thr Leu Ser Met Ser Asn Leu 
10 15 20 

tta gga get get act ace gaa gag tta teg get age aat age ttc gat 211 
Leu Gly Ala Ala Thr Thr Glu Glu Leu Ser Ala Ser Asn Ser Phe Asp 
25 30 35 

gga act aca tea aca aca age ttt tct agt aaa aca tea teg get aca 259 
Gly Thr Thr Ser Thr Thr Ser Phe Ser Ser Lys Thr Ser Ser Ala Thr 
40 45 50 

gat ggc acc aat tat gtt ttt aaa gat tct gta gtt ata gaa aat gta 307 
Asp Gly Thr Asn Tyr Val Phe Lys Asp Ser Val Val lie Glu Asn Val 
55 60 65 

ccc aaa aca ggg gaa act cag tct act agt tgt ttt aaa aat gac get 355 
Pro Lys Thr Gly Glu Thr Gin Ser Thr Ser Cys Phe Lys Asn Asp Ala 
70 75 80 85 

gca get gga gat eta aat ttc tta gga ggg gga ttt tct ttc aca ttt 403 
Ala Ala Gly Asp Leu Asn Phe Leu Gly Gly Gly Phe Ser Phe Thr Phe 
90 95 100 

age aat ate gat gca acc acg get tct gga get get att gga agt gaa 451 
Ser Asn He Asp Ala Thr Thr Ala Ser Gly Ala Ala He Gly Ser Glu 
105 110 H5 

gca get aat aag aca gtc acg tta tea gga ttt teg gca ctt tct ttt 499 
Ala Ala Asn Lys Thr Val Thr Leu Ser Gly Phe Ser Ala Leu Ser Phe 
120 125 130 

ctt aaa tec cca gca agt aca gtg act aat gga ttg gga get ate aat 547 
Leu Lys Ser Pro Ala Ser Thr Val Thr Asn Gly Leu Gly Ala He Asn 
135 140 145 

gtt aaa ggg aat tta age eta ttg gat aat gat aag gta ttg att cag 595 
Val Lys Gly Asn Leu Ser Leu Leu Asp Asn Asp Lys Val Leu lie Gin 
150 155 160 165 

gac aat ttc tea aca gga gat ggc gga gca att aat tgt gca ggc tec 643 
Asp Asn Phe Ser Thr Gly Asp Gly Gly Ala He Asn Cys Ala Gly Ser 
170 175 180 

ttg aag ate gca aac aat aag tec ctt tct ttt att gga aat agt tct 691 
Leu Lys He Ala Asn Asn Lys Ser Leu Ser Phe He Gly Asn Ser Ser 
185 190 195 
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Fig. 1 (con't) 

tea aca cgt ggc gga gcg att cat acc aaa aac etc aca eta tct tct 739 
Ser Thr Arg Gly Gly Ala lie His Thr Lys Asn Leu Thr Leu Ser Ser 
200 205 210 

ggt ggg gaa act eta ttt cag ggg aat aca gcg cct acg get get ggt 787 
Gly Gly Glu Thr Leu Phe Gin Gly Asn Thr Ala Pro Thr Ala Ala Gly 
215 220 225 

aaa gga ggt get ate gcg att gca gac tct ggc acc eta tec att tct 835 
Lys Gly Gly Ala He Ala He Ala Asp Ser Gly Thr Leu Ser He Ser 
230 235 240 245 

gga gac agt ggc gac att ate ttt gaa ggc aat acg ata gga get aca 883 
Gly Asp Ser Gly Asp He He Phe Glu Gly Asn Thr He Gly Ala Thr 
250 255 260 

gga acc gtc tct cat agt get att gat tta gga act age get aag ata 931 
Gly Thr Val Ser His Ser Ala He Asp Leu Gly Thr Ser Ala Lys He 
265 270 275 

act gcg tta cgt get gcg caa gga cat acg ata tac ttt tat gat ccg 979 
Thr Ala Leu Arg Ala Ala Gin Gly His Thr He Tyr Phe Tyr Asp Pro 
280 285 290 

att act gta aca gga teg aca tct gtt get gat get etc aat att aat 1027 
He Thr Val Thr Gly Ser Thr Ser Val Ala Asp Ala Leu Asn He Asn 
295 300 305 

age cct gat act gga gat aac aaa gag tat acg gga acc ata gtc ttt 1075 
Ser Pro Asp Thr Gly Asp Asn Lys Glu Tyr Thr Gly Thr He Val Phe 
310 315 320 325 

tct gga gag aag etc acg gag gca gaa get aaa gat gag aag aac cgc 1123 
Ser Gly Glu Lys Leu Thr Glu Ala Glu Ala Lys Asp Glu Lys Asn Arg 
330 335 340 

act tct aaa tta ctt caa aat gtt get ttt aaa aat ggg act gta gtt 1171 
Thr Ser Lys Leu Leu Gin Asn Val Ala Phe Lys Asn Gly Thr Val Val 
345 350 355 

tta aaa ggt gat gtc gtt tta agt gcg aac ggt ttc tct cag gat gca 1219 
Leu Lys Gly Asp Val Val Leu Ser Ala Asn Gly Phe Ser Gin Asp Ala 
360 365 370 

aac tct aag ttg att atg gat tta ggg acg teg ttg gtt gca aac acc 1267 
Asn Ser Lys Leu He Met Asp Leu Gly Thr Ser Leu Val Ala Asn Thr 
375 380 385 

gaa agt ate gag tta acg aat ttg gaa att aat ata gac tct etc agg 1315 
Glu Ser He Glu Leu Thr Asn Leu Glu He Asn He Asp Ser Leu Arg 
390 395 400 405 
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Fig. 1 (con't) 

aac ggg aaa aag ata aaa etc agt get gec aca get cag aaa gat att 1363 
Asn Gly Lys Lys lie Lys Leu Ser Ala Ala Thr Ala Gin Lys Asp He 
410 415 420 

cgt ata gat cgt cct gtt gta ctg gca att age gat gag agt ttt tat 1411 
Arg He Asp Arg Pro Val Val Leu Ala He Ser Asp Glu Ser Phe Tyr 
425 430 435 

caa aat ggc ttt ttg aat gag gac cat tec tat gat ggg att ctt gag 1459 
Gin Asn Gly Phe Leu Asn Glu Asp His Ser Tyr Asp Gly He Leu Glu 
440 445 450 

tta gat get ggg aaa gac ate gtg att tct gca gat tct cgc agt ata 1507 
Leu Asp Ala Gly Lys Asp He Val He Ser Ala Asp Ser Arg Ser He 
455 460 465 

gat get gta caa tct ccg tat ggc tat cag gga aag tgg acg ate aat 1555 
Asp Ala Val Gin Ser Pro Tyr Gly Tyr Gin Gly Lys Trp Thr He Asn 
470 475 480 485 

tgg tct act gat gat aag aaa get acg gtt tct tgg gcg aag cag agt 1603 
Trp Ser Thr Asp Asp Lys Lys Ala Thr Val Ser Trp Ala Lys Gin Ser 
490 495 500 

ttt aat ccc act get gag cag gag get ccg tta gtt cct aat ctt ctt 1651 
Phe Asn Pro Thr Ala Glu Gin Glu Ala Pro Leu Val Pro Asn Leu Leu 
505 510 515 

tgg ggt tct ttt ata gat gtt cgt tec ttc cag aat ttt ata gag eta 1699 
Trp Gly Ser Phe He Asp Val Arg Ser Phe Gin Asn Phe He Glu Leu 
520 525 530 

ggt act gaa ggt get cct tac gaa aag aga ttt tgg gtt gca ggc att 1747 
Gly Thr Glu Gly Ala Pro Tyr Glu Lys Arg Phe Trp Val Ala Gly He 
535 540 545 

tec aat gtt ttg cat agg age ggt cgt gaa aat caa agg aaa ttc cgt 1795 
Ser Asn Val Leu His Arg Ser Gly Arg Glu Asn Gin Arg Lys Phe Arg 
550 555 5fi0 565 

cat gtg agt gga ggt get gta gta ggt get age acg agg atg ccg ggt 1843 
His Val Ser Gly Gly Ala Val Val Gly Ala Ser Thr Arg Met Pro Gly 
570 575 580 

ggt gat acc ttg tct ctg ggt ttt get cag etc ttt gcg cgt gac aaa 1891 
Gly Asp Thr Leu Ser Leu Gly Phe Ala Gin Leu Phe Ala Arg Asp Lys 
585 590 595 

gac tac ttt atg aat acc aat ttc gca aag acc tac gca gga tct tta 1939 
Asp Tyr Phe Met Asn Thr Asn Phe Ala Lys Thr Tyr Ala Gly Ser Leu 
600 605 610 
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cgt ttg cag cac gat get tec eta tac tct gtg gtg agt ate ctt tta 1987 

Arg Leu Gin His Asp Ala Ser Leu Tyr Ser Val Val Ser lie Leu Leu 

615 620 625 

gga gag gga gga etc cgc gag ate ctg ttg cct tat gtt tec aag act 2035 

Gly Glu Gly Gly Leu Arg Glu lie Leu Leu Pro Tyr Val Ser Lys Thr 

630 635 640 645 

ctg ccg tgc tct ttc tat ggg cag ctt age tac ggc cat acg gat cat 2083 

Leu Pro Cys Ser Phe Tyr Gly Gin Leu Ser Tyr Gly His Thr Asp His 

650 655 660 

cgc atg aag ace gag tct eta ccc ccc ccc ccc ccg acg etc teg acg 2131 

Arg Met Lys Thr Glu Ser Leu Pro Pro Pro Pro Pro Thr Leu Ser Thr 

665 670 675 

gat cat act tct tgg gga gga tat gtc tgg get gga gag ctg gga act 2179 

Asp His Thr Ser Trp Gly Gly Tyr Val Trp Ala Gly Glu Leu Gly Thr 

680 685 690 

cga gtt get gtt gaa aat ace age ggc aga gga ttt ttc caa gag tac 2227 

Arg Val Ala Val Glu Asn Thr Ser Gly Arg Gly Phe Phe Gin Glu Tyr 

695 700 705 

act cca ttt gta aaa gtc caa get gtt tac get cgc caa gat age ttt 2275 

Thr Pro Phe Val Lys Val Gin Ala Val Tyr Ala Arg Gin Asp Ser Phe 

710 715 720 725 

gta gaa eta gga get ate agt cgt gat ttt agt gat teg cat ctt tat 2323 

Val Glu Leu Gly Ala lie Ser Arg Asp Phe Ser Asp Ser His Leu Tyr 

730 735 740 

aac ctt gcg att cct ctt gga ate aag tta gag aaa egg ttt gca gag 2371 

Asn Leu Ala lie Pro Leu Gly lie Lys Leu Glu Lys Arg Phe Ala Glu 

745 750 755 

caa tat tat cat gtt gta gcg atg tat tct cca gat gtt tgt cgt agt 2419 

Gin Tyr Tyr His Val Val Ala Met Tyr Ser Pro Asp Val Cys Arg Ser 

760 765 770 

aac ccc aaa tgt acg act acc eta ctt tec aac caa ggg agt tgg aag 2467 

Asn Pro Lys Cys Thr Thr Thr Leu Leu Ser Asn Gin Gly Ser Trp Lys 

775 780 785 

acc aaa ggt teg aac tta gca aga cag get ggt att gtt cag gee tea 2515 

Thr Lys Gly Ser Asn Leu Ala Arg Gin Ala Gly lie Val Gin Ala Ser 

790 795 800 805 

ggt ttt cga tct ttg gga get gca gca gag ctt ttc ggg aac ttt ggc 2563 

Gly Phe Arg Ser Leu Gly Ala Ala Ala Glu Leu Phe Gly Asn Phe Gly 

810 815 820 
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ttt gaa tgg egg gga tct tct cgt age tat aat gta gat gcg ggt age 2611 

Phe Glu Trp Arg Gly Ser Ser Arg Ser Tyr Asn Val Asp Ala Gly Ser 

825 830 835 

aaa ate aaa ttt tagegattte tctttcgatg ctatttttcc atggctattt 2663 
Lys lie Lys Phe 
840 

ttaaaatgat agccatggtt atagatacgt agtccttatt tcaaagaaga cactgttgca 2723 
ttagataege tctctgatcc ctcaaaa 2750 
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Figure 2 (RY-32) 

Restriction Enzyme analysis of CPN 100397 

Maell Mae I I I Bael 

Msel I Tsp45I Ddel | MboII 

II III I 

ATTTTAACGTGCGTATCATTTGTGACTAAGAGATAGACTTGCTTTCTTTATCTATCTTCT 

x + + + + + + 60 

TAAAATTGCACGCATAGTAAACACTGATTCTCTATCTGAACGAAAGAAATAGATAGAAGA 

BslI 
Smll | 

CviJI | | Hinfl Acil 

Mill I | Bce83I Tfil MboII 

II! I I > 

GTATTGG AAAG AAAG C CC C TTG AGGGAAAAAAAGGTTG TTATGAAG ATT C C ACT C CG CTT 

61 + + + + + + 120 

CATAACCTTTCTTTCGGGGAACTCCCTTTTTTTCCAACAATACTTCTAAGGTGAGGCGAA 

Apol 

Bbvl Fnu4HI 

Tsp509I Alul | 

Acelll | CviJI j 

EcoRV Rsal TaqI | | Tsel j Earl 

I I Ml II I 

TTTATTGATATCATTAGTACCTACGCTTTCTATGTCGAATTTATTAGGAGCTGCTACTAC 

12 i + + + + + + 180 

AAATAACTATAGTAATCATGGATGCGAAAGATACAGCTTAAATAATCCTCGACGATGATG 



Cac8I 
Bfal 
CviJI | 
MboII | 
Bcgl Nhel j 
I I I 



TaqI Alul 

Alul | CviJI 

CviJI j Bed Bcgl Hindlll | Bfal 

III I II' 



CGAAGAG TT ATCGG C TAG C AATAG CTT CG ATGG AACTAC AT CAAC AAC AAG CTTTTCTAG 

18 1 + + + + + + 240 

GCTTCTCAATAGCCGATCGTTATCGAAGCTACCTTGATGTAGTTGTTGTTCGAAAAGATC 

Tsp509I 

NlalV | Hinfl 

Sfcl Banl | | Dral | Sf cl 

CviJI | Bed | j | Msel | Tfil | 

II III I II I I 

TAAAACATCATCGGCTACAGATGGCACCAATTATGTTTTTAAAGATTCTGTAGTTATAGA 

241 + + + + + + 300 

ATTTTGTAGTAGCCGATGTCTACCGTGGTTAATACAAAAATTTCTAAGACATCAATATCT 
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BslI 
BslI | 
II 



Dral 
Msel 
Bbvl 
BseMII 
Bfal | 
Spel | | 
Ddel Accl j | | 
III I 



Aim 

CviJI 
Hgal 
MspAlI 
PvuII 
PstI 
Fnu4HI 
CviRI 
Tsel 
Fnu4HI | 
Sf cl j 
Tsel | j 
II I 



AAATGTAC C CAAAAC AGGGGAAACTC AGTCTACTAGTTGTTTTAAAAATGACG CTG CAG C 
TTTACATGGGTTTTGTCC CCTTTG AGT CAGATG ATCAAC AAAATTTTTACTG CGACGTCG 



Apol 
Tsp509I 
Bbvl 
Dpnl 



Bglll | 

BstYI j 

Sau3AI j 

I I 



Ddel 

|MnlI | Bpml SfaNI 

.... I I I I 

TGGAGATCTAAATTTCTTAGGAGGGGGATTTTCTTTCACATTTAGCAATATCGATGCAAC 

361 + + + + + + 420 

ACCTCTAGATTTAAAGAATCCTCCCCCTAAAAGAAAGTGTAAATCGTTATAGCTACGTTG 



Acelll 
BsaJI 
BstDSI 
CviRI 
Cjel | 
Clal | | 
Taql | | 
I I I 



Beef I 
Fnu4HI 
Alul | 
CviJI j 
Tsel j 
Mwol 
Hpyl78III | 
CviJI j j 

Mwol | j j 

Bbvl || i| 
III 



Alul 
CviJI 
Fnu4HI | 
Tsel | j 
Bpml | j | 
Cjel | | | | 
III 



Hpyl78III 

Maell | 

Maelll | j 

Taal j j 

Tsp45I j j 

Bbvl | | j 
I 



421 



CACGGCTTCTGGAGCTGCTATTGGAAGTGAAGCAGCTAATAAGACAGTCACGTTATCAGG 
GTGCCGAAGACCTCGACGATAACCTTCACTTCGTCGATTATTCTGTCAGTGCAATAGTCC 



Maelll 
Taal 
Tsp45I 

Rsal | Alul 
Msel TatI | |TspRI CviJI 

I I I I I I 

ATTTTCGGCACTTTCTTTTCTTAAATCCCCAGCAAGTACAGTGACTAATGGATTGGGAGC 

481 + + + + + + 540 

TAAAAGCCGTGAAAGAAAAGAATTTAGGGGTCGTTCATGTCACTGATTACCTAACCCTCG 
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Apol 
Tsp509I CviJI 
Msel |MseI j 

I I 



Hin4I 
Cjel 
Tsp509I 
Hpyl78III | 
Hinfl | j 

Tfil | | 

I I 



TATCAATGTTAAAGGGAATTTAAGCCTATTGGATAATGATAAGGTATTGATTCAGGACAA 

541 + + + + + + 600 

ATAGTTACAATTTCCCTTAAATTCGGATAACCTATTACTATTCCATAACTAAGTCCTGTT 



Tsp509I 
Msel | 
Vspl | 



NlalV 
| CviJI | 

| CacSI 
j CviRI | | | 
I Cjel | ! ■■ 
i III 

TTTCTCAACAGGAGATGGCGGAGCAATTAATTGTGCAGGCTCCTTGAAGATCGCAAACAA 

601 + + + + + + 660 

AAAGAGTTGTCCTCTACCGCCTCGTTAATTAACACGTCCGAGGAACTTCTAGCGTTTGTT 



Tsp509I 
Ecil | 
Acil| | 
Bed | | | 
I II I 



Dpnl 
BsmFI | MboII 
Sau3AI | |BsgI | 
Ml I I 



BsaAI 
Pmll 
Maell | 
Afllll j j Ecil Hinfl 
Bsbl j |AciI | Tfil 
Mill I 

TAAGTCCCTTTCTTTTATTGGAAATAGTTCTTCAACACGTGGCGGAGCGATTCATACCAA 

66 i + + + + + + 720 

ATTCAGGGAAAGAAAATAACCTTTATCAAGAAGTTGTGCACCGCCTCGCTAAGTATGGTT 



Tthlllll 

I 



XmnI 
MboII | 
I I 



Fnu4HI 
CviJI 
Tsel 
Mwol | 
Bbvl Haell | | 

MboII Mnll Cjel Hhal | j j 

II I II I I 

AAACCTCACACTATCTTCTGGTGGGGAAACTCTATTTCAGGGGAATACAGCGCCTACGGC 

721 + + + + + + 780 

TTTGGAGTGTGATAGAAGACCACCCCTTTGAGATAAAGTCCCCTTATGTCGCGGATGCCG 
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NlalV 
BanI 
AlwNI 
Hinf I 
CviRI 



Plel 
Hpyl78III 
Nrul 

Hin4I Thai 
Mnll Bcefl |CjeI j 

! II! 



Hpyl78III 
BsmAI | 



TGCTGGTAAAGGAGGTGCTATCGCGATTGCAGACTCTGGCACCCTATCCATTTCTGGAGA 

781 + h- + + + + 840 

ACGACCATTTCCTCCACGATAGCGCTAACGTCTGAGACCGTGGGATAGGTAAAGACCTCT 



TspRI 
Taal | 
I I 



Bpml 
I 



Sf cl 
Alul | 
Cvi JI j 
I I 



Ppil 
Taal ) BsmAI 
NlalV | jBsmBI 

I I ! 



CAGTGGCGACATTATCTTTGAAGGCAATACGATAGGAGCTACAGGAACCGTCTCTCATAG 
GTCACCGCTGTAATAGAAACTTCCGTTATGCTATCCTCGATGTCCTTGGCAGAGAGTATC 



Hhal 
Fspl 
FHU4HI 
Tsel 

Ddel BsaAI 

Haell Mwol 

Hhal 1 Mae I I | 

Eco47III | | Maelll | j 

Bfal | | j Bbvl | j j 

I III II II 

TGCTATTGATTTAGGAACTAGCGCTAAGATAACTGCGTTACGTGCTGCGCAAGGACATAC 

901 + + + + + + 960 

ACGATAACTAAATCCTTGATCGCGATTCTATTGACGCAATGCACGACGCGTTCCTGTATG 



Hpyl8 8IX 
Dpnl | 
Sau3AI | j 
Alwl | j 1 
I Mi 



Alwl 
SfaNI 
BsaBI | 
Taql | | 
Maelll Dpnl | | | 

Taal Sau3AI | j j | 
I I III 



GATATACTTTTATGATCCGATTACTGTAACAGGATCGACATCTGTTGCTGATGCTCTCAA 

961 + + + + + + 1020 

CTATATGAAAATACTAGGCTAATGACATTGTCCTAGCTGTAGACAACGACTACGAGAGTT 
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Msel 
Vspl 
Sspl | CviJI 
I I I 



Drdll 
NlalV 
BscGI 
Bpml | 
BstZ17I | | 
BslI BsrI Sthl32I Accl| |! 

I 



Hpyl7 8III 
I 



TATTAATAGCCCTGATACTGGAGATAACAAAGAGTATACGGGAACCATAGTCTTTTCTGG 

10 21 + + + + + + 1080 

ATAATTATCGGGACTATGACCTCTATTGTTTCTCATATGCCCTTGGTATCAGAAAAGACC 



Alul Alul 
CviJI BplI 
Mnll CviJI Tsp5 0 9I 

Hin4I j Bpml | Acil MboII | 

j ! II III 

AGAGAAGCTCACGGAGGCAGAAGCTAAAGATGAGAAGAACCGCACTTCTAAATTACTTCA 

1081 + + + + + + H40 

TCTCTTCGAGTG C CTC CGTCTTCGATTTCTACTCTTCTTGG CGTG AAGATTTAATG AAGT 

BsmFI 

Dral Taal Dral | HphI 

Msel | Sfcl| Msel | j Msel 

II II III I 

AAATGTTGCTTTTAAAAATGGGACTGTAGTTTTAAAAGGTGATGTCGTTTTAAGTGCGAA 

1141 + + + + + + 1200 

TTTACAACGAAAATTTTTACCCTGACATCAAAATTTTCCACTACAGCAAAATTCACGCTT 



Hpyl78III 
SfaNI 
Ppil| 
Taal | | 
XmnI | | Ddel 
III I 



Fokl 
Ddel 
BseMII | 
Hin4I j j 

CviRI | j j 

II II 



Aatll 
BsaHI | CviRI 
Mae I I j BsmFI | 
II II 

CGGTTTCTCTCAGGATGCAAACTCTAAGTTGATTATGGATTTAGGGACGTCGTTGGTTGC 

1201 + + + + + + 1260 

GCCAAAGAGAGTCCTACGTTTGAGATTCAACTAATACCTAAATCCCTGCAGCAACCAACG 



Apol 

Tsp509I Plel Hpyl78III 

Hindi | Msel | Ddel | 

Tthlllll Hpal j Vspl j Sthl32l| | 

Taql|Mselj | Tsp509I | j Hinfl jj j BscGI 

II II I III I II I I 

AAACACCGAAAGTATCGAGTTAACGAATTTGGAAATTAATATAGACTCTCTCAGGAACGG 

1261 + + + + + + 1320 

TTTGTGGCTTTCATAGCTCAATTGCTTAAACCTTTAATTATATCTGAGAGAGTCCTTGCC 
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Fig. 2 (con't) 



Bbvl 
BseMII 1 



Hpyl88IX 
Ddel 
Alul | 
TspRI Cvi JI I 
Fnu4HI I BseMII | I 
Ddel Tsel| |MwoI | j j 
I III I Ml 



Dpnl 
Sau3AI | 
BseMII | | 

Acelll | | | 

I I 



GAAAAAGATAAAACTCAGTGCTGCCACAGCTCAGAAAGATATTCGTATAGATCGTCCTGT 

H + (. + + + 

CTTTTTCTATTTTGAGTCACGACGGTGTCGAGTCTTTCTATAAGCATATCTAGCAGGACA 



BsrI 
Tsp509I 

Rsal | Mnll Avail 

Tat I S Hin4I CviJI | Sau96I MslI 

III I II II 

TGTACTGGCAATTAGCGATGAGAGTTTTTATCAAAATGGCTTTTTGAATGAGGACCATTC 

381 + + + + + + 

ACATGACCGTTAATCGCTACTCTCAAAAATAGTTTTACCGAAAAACTTACTCCTGGTAAG 



Hpyl7 8III 
Smll ' 
SfaNI 
Hinf I 
Tf il 
Bed | 
Bsll| | 
Xcml | | | 

I Ml 



Hpyl78III 
Bce83I | 
I I 



Hinf I 
Tf il 
PstI | 
CviRI | j 
Sfcl | | j SfaNI 
I I II 



CTATGATGGGATTCTTGAGTTAGATGCTGGGAAAGACATCGTGATTTCTGCAGATTCTCG 

1441 + + + + + + 1500 

GATACTACCCTAAGAACTCAATCTACGACCCTTTCTGTAGCACTAAAGACGTCTAAGAGC 

Muni 

Rsal Tsp509I 
BsrGI | Dpnl | 

TatI CviJI Sau3AI | j AccI 

II I I I I I 

CAGTATAGATGCTGTACAATCTCCGTATGGCTATCAGGGAAAGTGGACGATCAATTGGTC 

1501 + + + + + + 1560 

GTCATATCTACGACATGTTAGAGGCATACCGATAGTCCCTTTCACCTGCTAGTTAACCAG 

Mnll 
TspRI 

BseMII Bpull02I | 
Alul Msel | Ddel j 

CviJI Taal Hin4I| | BtsI || 

II II I I M 

TACTGATGATAAGAAAGCTACGGTTTCTTGGGCGAAGCAGAGTTTTAATCCCACTGCTGA 

1561 + + + + + + 1620 

ATGACTACTATTCTTTCGATGCCAAAGAACCCGCTTCGTCTCAAAATTAGGGTGACGACT 
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BsaXI 
NlalV 
Cvi JI | MboII 
I I I 

GCAGGAGGCTCCGTTAGTTCCTAATCTTCTTTGGGGTTCTTTTATAGATGTTCGTTCCTT 

1621 + + + + + + 1680 

CGTCCTCCGAGGCAATCAAGGATTAGAAGAAACCCCAAGAAAATATCTACAAGCAAGGAA 



Apol 
Tsp509I 
Hpyl78III | 
I I 



Bf al 
Alul j 
CviJl | Rsal 
I I I 



BsiHKAI 
Bspl286I 
I 



EC057I 
I 



1681 



CCAGAATTTTATAGAGCTAGGTACTGAAGGTGCTCCTTACGAAAAGAGATTTTGGGTTGC 
GGTCTTAAAATATCTCGATCCATGACTTCCACGAGGAATGCTTTTCTCTAAAACCCAACG 



Hpyl78III 
BsiEI | 

BsaXI | j MslI 
Acil | j j Apol Mnll| 

Cac8I CviRI BsrBI j j j Tsp509I Nlalll j 

I I I II I I M 

AGGCATTTCCAATGTTTTGCATAGGAGCGGTCGTGAAAATCAAAGGAAATTCCGTCATGT 

1741 v + + h + + 1800 

TCCGTAAAGGTTACAAAACGTATCCTCGCCAGCACTTTTAGTTTCCTTTAAGGCAGTACA 



Sfcl 
BsaXI | 
I I 



MslI 
Sthl32I 
BssSI 
Cac8I 
CjePI 
Bf al | 
Mnll j 
Sf aNI j 
Nhel j j 
I I I 



Neil 
ScrFI 
Mspl | 

I I 



HaelV BsmAI 
Hin4I CjePI | 
Fokl | HphI | j 

I I III 



GAGTGGAGGTGCTGTAGTAGGTGCTAGCACGAGGATGCCGGGTGGTGATACCTTGTCTCT 



CTCACCTCCACGACATCATCCACGATCGTGCTCCTACGGCCCACCACTATGGAACAGAGA 



Acelll 
BseMII 
Maelll 
Tsp4 5I 
Hhal | 
Thai | 

Mwol I j Tsp5091 
I I I I 
GGGTTTTGCTCAGCTCTTTGCGCGTGACAAAGACTACTTTATGAATACCAATTTCGCAAA 

1861 + + + + + + 1920 

CCCAAAACGAGTCGAGAAACGCGCACTGTTTCTGATGAAATACTTATGGTTAAAGCGTTT 



Alul 
CviJI 
Bpull02I | 
Ddel j 
I I 



12/165 



SUBSTITUTE SHEET (RULE 26) 



WO 00/24765 



PCT/CA99/00992 



Fig. 2 (con't) 



Fnu4HI 
CviRI 
Tsel 

Dpnl SfaNI | 
BstYI | Alwl | I 
Sau3AI I Maell j j | Bbvl 
II II 

GACCTACGCAGGATCTTTACGTTTGCAGCACGATGCTTCCCTATACTCTGTGGTGAGTAT 



CTGGATGCGTCCTAGAAATGCAAACGTCGTGCTACGAAGGGATATGAGACACCACTCATA 



BciVI 
Mnll 

HphI 
Mnll 



Dpnl 
BstYI 
Sau3AI 
Thai 
Acil | 
Alwl I 
Plel Hinfl | I 
I I 



Plel 
Bcefl | Hinfl 
I I I 

CCTTTTAGGAGAGGGAGGACTCCGCGAGATCCTGTTGCCTTATGTTTCCAAGACTCTGCC 

1981 + + + + + + 2040 

GGAAAATCCTCTCCCTCCTGAGGCGCTCTAGGACAACGGAATACAAAGGTTCTGAGACGG 



BsiHKAI 
Bspl286I 
I 



CviJI 
Haelll 
Bbvl 
Eael 
Gdill 
Alul 



CviJI 
Bpull02I 
Ddel 
Alul | 
CviJI j 
Fnu4HI | j 
Tsel| || 



Bbsl 
Hin4I 
Hinfl 
Tthllll 
Nlalll 
Alwl 



Bcefl | 

Dpnl | | 

Sau3AI j | | 

I I I 



GTGCTCTTTCTATGGGCAGCTTAGCTACGGCCATACGGATCATCGCATGAAGACCGAGTC 

2041 + + + + + + 2100 

CACGAGAAAGATACCCGTCGAATCGATGCCGGTATGCCTAGTAGCGTACTTCTGGCTCAG 



Taqll 
Plel | 
BstnAI | j 
MboII | | I 
I II I 



Dpnl 
Sau3AI 
Hgal 
Hpyl78lII | 
Taql | 
Sthl32I | | 
I I I 



Mnll 
Alwl | 
I I 



Acelll 
BsaXI | 
Hin4I j 
I I 



TCTACCCCCCCCCCCCCCGACGCTCTCGACGGATCATACTTCTTGGGGAGGATATGTCTG 

2101 + + + + + + 2160 

AGATGGGGGGGGGGGGGGCTGCGAGAGCTGCCTAGTATGAAGAACCCCTCCTATACAGAC 
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Fnu4HI 

TaqI Taul 
Aval | Acil | 

Alul Smllj MspAllj 
CviJI CviJI Xhol I Bpml Mnll | j 

I I II ! Ill 

GGCTGGAGAGCTGGGAACTCGAGTTGCTGTTGAAAATACCAGCGGCAGAGGATTTTTCCA 

2161 + + + + + + 2220 

CCGACCTCTCGACCCTTGAGCTCAACGACAACTTTTATGGTCGCCGTCTCCTAAAAAGGT 

Alul 

Rsal Alul Cac8I CviJI 

TatI | CviJI Mwol | Mwol | 

II I I I I I 

AGAGTACACTCCATTTGTAAAAGTCCAAGCTGTTTACGCTCGCCAAGATAGCTTTGTAGA 

2221 + + + + + + 2280 

TCTCATGTGAGGTAAACATTTTCAGGTTCGACAAATGCGAGCGGTTCTATCGAAACATCT 

BsaBI 

Alul Hinfl | CjePI Hinfl 

Bfal CviJI Hpyl78III Tfil j SfaNI Tfil 

II I II II 
ACTAGGAGCTATCAGTCGTGATTTTAGTGATTCGCATCTTTATAACCTTGCGATTCCTCT 

2281 + + + + + + 2340 

TGATCCTCGATAGTCAGCACTAAAATCACTAAGCGTAGAAATATTGGAACGCTAAGGAGA 

Mnll 

Hinfl | CviRI Bpml Tthlllll 

Tfil | CjePI Taal | Sspl Nlalll | Hin4I | 

II III I II II 

TGGAATCAAGTTAGAGAAACGGTTTGCAGAGCAATATTATCATGTTGTAGCGATGTATTC 

2341 + + + + + + 2400 

ACCTTAGTTCAATCTCTTTGCCAAACGTCTCGTTATAATAGTACAACATCGCTACATAAG 

BslI 

Maelll BsaJI | 

Hpyl78III Pflll08I | Rsal Mmel Styl j 

I II I I M 

TCCAGATGTTTGTCGTAGTAACCCCAAATGTACGACTACCCTACTTTCCAACCAAGGGAG 

2401 + + + + + + 2460 

AGGTCTACAAACAGCATCATTGGGGTTTACATGCTGATGGGATGAAAGGTTGGTTCCCTC 

BSU36I 

NspV Ddel 
TaqI CviJI 
MboII | Hael 
Mmel | j CviJI Haelll 

Bbsl| || Ddel Mwol | StuI | Acelll 

II II I II II I 

TTGGAAGACCAAAGGTTCGAACTTAGCAAGACAGGCTGGTATTGTTCAGGCCTCAGGTTT 

2461 + + + + + + 2520 

AACCTTCTGGTTTCCAAGCTTGAATCGTTCTGTCCGACCATAACAAGTCCGGAGTCCAAA 
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Fig. 2 (con't) 



- BseMII 
Dpnl 
Sau3AI 
Bbvl | 
Mill I I 
Taql| 
I I 



Hpyl78III 
Alul 
CviJI 
Sthl32I 
PstI 



Fnu4HI 
CviRI 
Tsel 
Fnu4HI 
Sf cl 
Alul | 
CviJI j 
Tsel j 
I 



Bbvl 
I 



Faul 
Sthl32I | 
CviJI | | 
I II 



Dpnl 
BstYI 
Sau3AI 
Acil | 
Mbollj j 



TCGATCTTTGGGAGCTGCAGCAGAGCTTTTCGGGAACTTTGGCTTTGAATGGCGGGGATC 

2521 + + + + + + 2580 

AGCTAGAAACCCTCGACGTCGTCTCGAAAAGCCCTTGAAACCGAAACTTACCGCCCCTAG 



Faul 
Sthl32I 
Sf aNI 
Alul | 
CviJI j 
Pflll08I | j 
Alwl | j j 
I I II 



Apol 

Acil Tsp509I SfaNI 

I I I 

TTCTCGTAGCTATAATGTAGATGCGGGTAGCAAAATCAAATTTTAGCGATTTCTCTTTCG 

2581 + + + + + + 2640 

AAGAGCATCGATATTACATCTACGCCCATCGTTTTAGTTTAAAATCGCTAAAGAGAAAGC 

CviJI Nlalll 

NlaIIl| BsaJI | BsaAI 

BsaJI | | BstDSI | HaelV 

BstDSI j | Ncol j Hin4I 

Ncol | | Dral Styl j SnaBI 

Styl jj Msel| CviJI | j Maell | 

I II II II I M 

ATGCTATTTTTCCATGGCTATTTTTAAAATGATAGCCATGGTTATAGATACGTAGTCCTT 

2641 + + + + + + 2700 

TACGATAAAAAGGTACCGATAAAAATTTTACTATCGGTACCAATATCTATGCATCAGGAA 



TaqI 
I 



CviRI 
MboII 
TspRI | 
Taal | | 
Bbsl| || 
II ■ 



Dpnl 
Sau3AI 
Hpyl88IX | 
Alwl | | 
Hin4I | | | 
II II 



ATTTCAAAGAAGACACTGTTGCATTAGATACGCTCTCTGATCCCTCAAAA 

2701 + + + + + 2750 

TAAAGTTTCTTCTGTGACAACGTAATCTATGCGAGAGACTAGGGAGTTTT 
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Figure 3: CPN1 00421 

ctcctgtccc tcgcgttgtc aacctacccc tcctccctcg aattctaatc ctttgaacgt 60 

agtacaacag cctgttgctg catcgtcagt gccttcctac atg ccc cca ctg aat 115 

Met Pro Pro Leu Asn 



get gat gat gtt etc cct aga gac cat ctg tea gat gga agt ttc tea 

Ala Asp Asp Val Leu Pro Arg Asp His Leu Ser Asp Gly Ser Phe Ser 

10 15 20 

gat acg tat cca gac att aca acg caa gcg ate ate tta att ttc ttg 

Asp Thr Tyr Pro Asp lie Thr Thr Gin Ala lie lie Leu lie Phe Leu 

25 30 35 

gee eta teg cct ttc ctg gtc atg ttg etc act teg tat eta aag att 

Ala Leu Ser Pro Phe Leu Val Met Leu Leu Thr Ser Tyr Leu Lys lie 



ate att act tta gtc tta tta cgt aac gee tta gga gta caa caa aca 
lie lie Thr Leu Val Leu Leu Arg Asn Ala Leu Gly Val Gin Gin Thr 



cct ccc agt caa gtc etc aat ggg att gca etc ate eta tct att tat 
Pro Pro Ser Gin Val Leu Asn Gly lie Ala Leu lie Leu Ser lie Tyr 



gtg atg ttc ccc acg gga gtg get atg tat aaa gat get cgc aag gaa 

Val Met Phe Pro Thr Gly Val Ala Met Tyr Lys Asp Ala Arg Lys Glu 
90 95 100 

ate gaa gee aat acc att cct caa age etc ttc act gca gaa ggt get 

lie Glu Ala Asn Thr He Pro Gin Ser Leu Phe Thr Ala Glu Gly Ala 
105 110 115 

gaa aca gtg ttt gtc get tta aac aaa tct aaa gaa cct ttg cgc tct 

Glu Thr Val Phe Val Ala Leu Asn Lys Ser Lys Glu Pro Leu Arg Ser 
120 125 130 

ttc tta att cgc aac act cca aaa gca caa att caa age ttt tac aag 

Phe Leu He Arg Asn Thr Pro Lys Ala Gin He Gin Ser Phe Tyr Lys 

135 140 145 

ate tea cag aaa acc ttc cct teg gaa att cga gcg cac etc act gee 

He Ser Gin Lys Thr Phe Pro Ser Glu He Arg Ala His Leu Thr Ala 
150 155 160 165 

tec gac ttt gta ate att att cct get ttt att atg ggt cag ata aaa 

Ser Asp Phe Val He lie He Pro Ala Phe He Met Gly Gin He Lys 
170 175 180 

aat get ttc gaa att gga gtc ttg ate tat eta cct ttc ttt gtt att 

Asn Ala Phe Glu He Gly Val Leu He Tyr Leu Pro Phe Phe Val He 
185 190 195 
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gat tta gtg act get aac gtt ctt gta gcg atg cag atg atg atg tta 73 9 
Asp Leu Val Thr Ala Asn Val Leu Val Ala Met Gin Met Met Met Leu 
200 205 210 

tec cct eta teg att teg tta cct tta aag tta ctt ttg ate gtc atg 787 
Ser Pro Leu Ser lie Ser Leu Pro Leu Lys Leu Leu Leu lie Val Met 
215 220 225 

gta gac gga tgg aca tta ctg etc caa ggg ctt atg ate age ttt aaa 835 
Val Asp Gly Trp Thr Leu Leu Leu Gin Gly Leu Met lie Ser Phe Lys 
230 235 240 245 

taaggacacg tgccgtgtta gcatttttcg caactagttt caaatctgtt ctttttgagt 895 

actcctacca atcattatta cttattttga ttgtttcggc acctcccatc atcttagctt 955 

ecatagtegg gattatggtt gcgatcttcc aagccgcaac acaaa 1000 
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Figure 4 (RY-34) 

Restriction enzyme analysis of CP 100421 



BseRI 
Hindi | 
Thai Mnll | j BsaXI 
I III I 



Mnll 
Apol 
EcoRI 
Tsp509I 
Mnll [ 
Taql | | 
I I I 



CTCCTGTCCCTCGCGTTGTCAACCTACCCCTCCTCCCTCGAATTCTAATCCTTTGAACGT 
GAGGACAGGGAGCGCAACAGTTGGATGGGGAGGAGGGAGCTTAAGATTAGGAAACTTGCA 



Bbvl CviRI 
Rsal | Fnu4HI [ TspRI Nlalll BsmI 

TatI | j CviJI Tsel | [ Sf aNI | Nspl TspRI | 

I II I II I II I II 

AGTACAACAGCCTGTTGCTGCATCGTCAGTGCCTTCCTACATGCCCCCACTGAATGCTGA 



61 



TCATGTTGTCGGACAACGACGTAGCAGTCACGGAAGGATGTACGGGGGTGACTTACGACT 



Bfal 
Bsal | 
BsmAI 



I I 



Hpyl88IX 
And I | 
HaeIV| | 
Hin4l| j 
Bed | |Xcml|BccI 
III II I 



BseMII 
Hpyl78III 
BsaAI 
BsaBI 
Hpyl88IX SnaBI 
Ddel |MaeIIl 
I I M 



BciVI 
I 



TGATGTTCTCCCTAGAGACCATCTGTCAGATGGAAGTTTCTCAGATACGTATCCAGACAT 
ACTACAAGAGGGATCTCTGGTAGACAGTCTACCTTCAAAGAGTCTATGCATAGGTCTGTA 



Dpnl 
Sau3AI |TspS09I 
Cac8I | | Msel | 

III II 



CviJI 
Haelll 
Sau96I | 
I I 



ScrFI 
EcoRI I | 
I I 



TACAACGCAAGCGATCATCTTAATTTTCTTGGCCCTATCGCCTTTCCTGGTCATGTTGCT 



ATGTTGCGTTCGCTAGTAGAATTAAAAGAACCGGGATAGCGGAAAGGACCAGTACAACGA 



BsaAI 
Maelll 

SnaBI Bsu36I Rsal 
Maell| Ddel TatI | 
II III 
CACTTCGTATCTAAAGATTATCATTACTTTAGTCTTATTACGTAACGCCTTAGGAGTACA 

241 + + + + + + 300 

GTGAAGCATAGATTTCTAATAGTAATGAAATCAGAATAATGCATTGCGGAATCCTCATGT 
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Mnll 
Tthlllll 
Bmrl I CviRI 

BsrI I I Fokl Mnll | Sthl32I 

I I I I I I I 

ACAAACACCTCCCAGTCAAGTCCTCAATGGGATTGCACTCATCCTATCTATTTATGTGAT 

301 + + + + + + 360 

TGTTTGTGGAGGGTCAGTTCAGGAGTTACCCTAACGTGAGTAGGATAGATAAATACACTA 



SfaNI 
CviJI 
BstXI 
MslI I 



BscGI I 
BsaJI I I 
BstDSI j I 
I I 



TaqI 
CjePI I CviJI 
I I I 



Bcgl 



Bcgl CacSI 
I I 

GTTCCCCACGGGAGTGGCTATGTATAAAGATGCTCGCAAGGAAATCGAAGCCAATACCAT 

361 + + + + + + 420 

CAAGGGGTGCCCTCACCGATACATATTTCTACGAGCGTTCCTTTAGCTTCGGTTATGGTA 



Tthlllll 
BstAPI 
PstI 
TspRI 
CviRI 
Mnll 
BtsI 
Sf cl 
Earl 
CjePI 
Mnll j 

CviJI I I I I I I I I TspRI Dral 

MboII I j ill I |MwoI I Taal | Msel | 

I I I Ml I I I I I I II 

TCCTCAAAGCCTCTTCACTGCAGAAGGTGCTGAAACAGTGTTTGTCGCTTTAAACAAATC 

421 + + + + + + 480 

AGGAGTTTCGGAGAAGTGACGTCTTCCACGACTTTGTCACAAACAGCGAAATTTGTTTAG 



Alul 
CviJI 
Hindi I I I 

Tsp509I Apol I j 

Hhal Msel I Bsbl Tsp509I j j 

III I III 

TAAAGAACCTTTGCGCTCTTTCTTAATTCGCAACACTCCAAAAGCACAAATTCAAAGCTT 

4al + + + + + + 540 

ATTTCTTGGAAACGCGAGAAAGAATTAAGCGTTGTGAGGTTTTCGTGTTTAAGTTTCGAA 
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Dpnl 
Bglll | 
BstYI j 
Sau3AI j 
I I 



Apol 

Tsp509I Hhal 
XmnI Hpyl8 8IX |TaqI Hin4I 

I I I I I 

TTACAAGATCTCACAGAAAACCTTCCCTTCGGAAATTCGAGCGCACCTCACTGCCTCCGA 

54X + + + + + + 600 

AATGTTCTAGAGTGTCTTTTGGAAGGGAAGCCTTTAAGCTCGCGTGGAGTGACGGAGGCT 



Hpyl8 8IX 
Mnll | 
TspRI | j 
Btsl | | | 
I I I 



Tsp509I 

Mnll Hpyl8 8IX NspV | 

BplI I Mmel SimI | TaqI j Hinfl 

II I II III 

CTTTGTAATCATTATTCCTGCTTTTATTATGGGTCAGATAAAAAATGCTTTCGAAATTGG 

601 + + + + + + 660 

GAAACATTAGTAATAAGGACGAAAATAATACCCAGTCTATTTTTTACGAAAGCTTTAACC 

Dpnl 
Plel 

Sau3AI | SfaNI 
Hpyl78IIl| j Maelll Acll | 

Hin4l|| j Tsp45I Mae I I j 

III I III 
AGTCTTGATCTATCTACCTTTCTTTGTTATTGATTTAGTGACTGCTAACGTTCTTGTAGC 

6S1 + + + + + + 720 

TCAGAACTAGATAGATGGAAAGAAACAATAACTAAATCACTGACGATTGCAAGAACATCG 

Maelll 
Mnll | Maelll 
Clal | | Dral | Dpnl 

CviRI TaqI j j Msel | j Sau3AI | 

I I I I II I M 

GATGCAGATGATGATGTTATCCCCTCTATCGATTTCGTTACCTTTAAAGTTACTTTTGAT 

721 + + + + + + 780 

CTACGTCTACTACTACAATAGGGGAGATAGCTAAAGCAATGGAAATTTCAATGAAAACTA 



Beef I 
Dral 
Msel 
Alul 
CviJI 
Dpnl 
Bell 



AccI 
Nlalll | 
I I 



Sau3AI 
CviJI 
Mwol 
BsaJI | 
Styl | 
Fokl | | 
I I I 



CGTCATGGTAGACGGATGGACATTACTGCTCCAAGGGCTTATGATCAGCTTTAAATAAGG 



GCAGTACCATCTGCCTACCTGTAATGACGAGGTTCCCGAATACTAGTCGAAATTTATTCC 
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Fig. 4 (con't) 



BsaAI 

Pmll Rsal 

Maell | Bf al Seal 

Afllll I Mwol Mwol Spel| TatI | 

I II I I II M 

ACACGTGCCGTGTTAGCATTTTTCGCAACTAGTTTCAAATCTGTTCTTTTTGAGTACTCC 

841 + + + + + + 900 

- TGTGCACGGCACAATCGTAAAAAGCGTTGATCAAAGTTTAGACAAGAAAAACTCATGAGG 



SChl32I 

Alul 
CviJI 

NlalV Ddel | 

BanI | Bed Mnll | | 
II I II I 

TACCAATCATTATTACTTATTTTGATTGTTTCGGCACCTCCCATCATCTTAGCTTCCATA 

901 + + + + + + 960 

ATGGTTAGTAATAATGAATAAAACTAACAAAGCCGTGGAGGGTAGTAGAATCGAAGGTAT 



Dpnl 

Hpyl7 8III Sau3AI | 

BslI | MboII | I 

II 



Acil 
Fnu4HI 
Taul 
CviJI | Bsbl 
I I I 



GTCGGGATTATGGTTGCGATCTTCCAAGCCGCAACACAAA 

951 + + + + 1000 

CAGCCCTAATACCAACGCTAGAAGGTTCGGCGTTGTGTTT 
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Figure 5: 

tagctttata caaagtatag aaaaataaca cgacaataaa aggagcggtg ttttctcttc 60 

tgaggtaaat cagcctcaaa gatactacgc catagtaaag atg aag ttt ttt age 115 

Met Lys Phe Phe Ser 



tta att ttt aaa gat gat gat gtc tec cca aat aag aag gtt tta tct 
Leu lie Phe Lys Asp Asp Asp Val Ser Pro Asn Lys Lys Val Leu Ser 



cct gaa get ttc tct get ttc ctt gat gec aaa gag ctg tta gaa aaa 
Pro Glu Ala Phe Ser Ala Phe Leu Asp Ala Lys Glu Leu Leu Glu Lys 



aca aaa gec gat age gaa gee tat gtt gca gag aca gaa caa aag tgt 
Thr Lys Ala Asp Ser Glu Ala Tyr Val Ala Glu Thr Glu Gin Lys Cys 



gca caa att cgt caa gaa get aaa gat caa gga ttt aaa gag gga tct 
Ala Gin lie Arg Gin Glu Ala Lys Asp Gin Gly Phe Lys Glu Gly Ser 



gaa tec tgg age aag caa att get ttc tta gaa gaa gaa act aaa aat 
Glu Ser Trp Ser Lys Gin lie Ala Phe Leu Glu Glu Glu Thr Lys Asn 



eta cgc ata aga gta cgc gag gec ttg gtt cct ctg gca att gcg agt 

Leu Arg lie Arg Val Arg Glu Ala Leu Val Pro Leu Ala lie Ala Ser 
90 95 100 

gtg agg aaa ate att ggg aag gaa etc gaa tta cat cct gaa act att 

Val Arg Lys lie lie Gly Lys Glu Leu Glu Leu His Pro Glu Thr lie 

105 110 115 

gtc tct att att tct caa gca ttg aaa gag etc aca caa aat aaa cat 

Val Ser lie lie Ser Gin Ala Leu Lys Glu Leu Thr Gin Asn Lys His 

120 125 130 

ate att ate tct gtc aat ccc aaa gat tta cct ctt gtt gag aaa agt 

lie lie He Ser Val Asn Pro Lys Asp Leu Pro Leu Val Glu Lys Ser 
135 140 145 

cgt cct gaa etc aag aac ate gtg gag tat get gac tec tta att ctt 

Arg Pro Glu Leu Lys Asn He Val Glu Tyr Ala Asp Ser Leu He Leu 
150 155 160 165 

aca gca aaa cct gat gtt act cct ggg ggt tgc att ate gag act gaa 

Thr Ala Lys Pro Asp Val Thr Pro Gly Gly Cys He He Glu Thr Glu 
170 175 180 

gca ggg ate ate aat gcg cag ctt gat gta caa tta gat gec tta gaa 

Ala Gly He He Asn Ala Gin Leu Asp Val Gin Leu Asp Ala Leu Glu 

185 190 195 
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aaa get ttc teg act ata eta aaa gcg aag aac cct gta gac gag cca 739 
Lys Ala Phe Ser Thr lie Leu Lys Ala Lys Asn Pro Val Asp Glu Pro 
200 205 210 

tct gag act tea tea tec acg gat tct tct tct tta tct aat gat cag 787 
Ser Glu Thr Ser Ser Ser Thr Asp Ser Ser Ser Leu Ser Asn Asp Gin 
215 220 225 

gat aag aaa gaa taaaggtatt cactattatg cgatccattt ttcgattttc 839 

Asp Lys Lys Glu 

230 

cctttgtttt tttacgctga gcgtctcatg ctgatttget gacgccagtc tatatgaaaa 899 
c 900 
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Figure 6 (RY-35) 

Restriction analysis of CPN1 00422 

BseMII 
MboII | Ddel 

Alul AciI I i Hpyl88IX 

CviJI BsrBI | | Mill I j 

| I I 1 I 1 

TAGCTTTATACAAAGTATAGAAAAATAACACGACAATAAAAGGAGCGGTGTTTTCTCTTC 

1 + + + + + + < 

ATCGAAATATGTTTCATATCTTTTTATTGTGCTGTTATTTTCCTCGCCACAAAAGAGAAG 

Tsp509I 
Msel | 
Alul | | 

Earl CviJI Mnll CviJI j j 

II! II! 

TGAGGTAAATCAGCCTCAAAGATACTACGCCATAGTAAAGATGAAGTTTTTTAGCTTAAT 

61 + + + + + + 

ACTCCATTTAGTCGGAGTTTCTATGATGCGGTATCATTTCTACTTCAAAAAATCGAATTA 

Alul 
CviJI 
Hindlll | SfaNI 
Hpyl78III | I Mwol | 

II II I I I M 

TTTTAAAGATGATGATGTCTCCCCAAATAAGAAGGTTTTATCTCCTGAAGCTTTCTCTGC 

1 + + + + + + : 

AAAATTTCTACTACTACAGAGGGGTTTATTCTTCCAAAATAGAGGACTTCGAAAGAGACG 

CviRI 

Eco57I Alul BsmAI | 

Acellll CviJI CviJI CviJI Mwol | 

II I I III 

TTTCCTTGATGCCAAAGAGCTGTTAGAAAAAACAAAAGCCGATAGCGAAGCCTATGTTGC 

1 + + + + + + 

AAAGGAACTACGGTTTCTCGACAATCTTTTTTGTTTTCGGCTATCGCTTCGGATACAACG 



Dral 
Msel | 



Hin4I BsmAI 



Apol 
Tsp509I 
BsiHKAI 
Bspl286I 
BseSI | 
CviRI | 
ApaLI | j 
' I ' 



Dpnl 
Sau3AI 
Alul j 
CviJI | 
Hpyl7 8III | | 

I 



Dral 
Msel | 
Mnll | j 
I I 



AGAGACAGAACAAAAGTGTGCACAAATTCGTCAAGAAGCTAAAGATCAAGGATTTAAAGA 
TCTCTGTCTTGTTTTCACACGTGTTTAAGCAGTTCTTCGATTTCTAGTTCCTAAATTTCT 
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Fig. 6 (con't) 



ScrFI 
EcoRII 
Alwl 
Hinf I 
Tf il 
BsaBI 



Hpyl88IX 
Dpnl | 
BstYI | I 
Sau3AI I | 

I I 



Tsp509I 
Cac8I | 
I I 



Tthlllll 
Bpml | 
Ddel | 
I I 



Mho 1 1 
MboII | 

I I 



301 



GGGATCTGAATCCTGGAGCAAGCAAATTGCTTTCTTAGAAGAAGAAACTAAAAATCTACG 
CCCTAGACTTAGGACCTCGTTCGTTTAACGAAAGAATCTTCTTCTTTGATTTTTAGATGC 



BsaJI 
Styl 
Cvi JI | 
Hael I 
Haelll j 
Thai | j 

Rsal | j | NlalV 

Mnll | j Stul|DrdIl| 
111 II II 



CjePI 
Mnll 
Mnll | 
Muni | j 
Tsp509I j j 
' I ' 



CATAAGAGTACGCGAGGCCTTGGTTCCTCTGGCAATTGCGAGTGTGAGGAAAATCATTGG 

! + + + + + + ' 

GTATTCTCATGCGCTCCGGAACCAAGGAGACCGTTAACGCTCACACTCCTTTTAGTAACC 

Hpyl78III 
CjePI | 
Tsp509I | j 
Fokl TaqI | j | Bce83I BsmAI Smll 

I I I I I I I I 

GAAGGAACTCGAATTACATCCTGAAACTATTGTCTCTATTATTTCTCAAGCATTGAAAGA 

1 + + + + + + ' 

CTTCCTTGAGCTTAATGTAGGACTTTGATAACAGAGATAATAAAGAGTTCGTAACTTTCT 



Banll 
BsiHKAI 
Bspl286I 
Sad 
Tthlllll 
Alul | 
CviJI j 

I I 1 
GCTCACACAAAATAAACATATCATTATCTCTGTCAATCCCAAAGATTTACCTCTTGTTGA 



Mnll 



481 



CGAGTGTGTTTTATTTGTATAGTAATAGAGACAGTTAGGGTTTCTAAATGGAGAACAACT 
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Fig. 6 (con't) 



Hpyl78III Tsp509I 
Bce83I Hpyl78III Smll | Plel Hinfl Msel | 

| I I I I I II 

GAAAAGTCGTCCTGAACTCAAGAACATCGTGGAGTATGCTGACTCCTTAATTCTTACAGC 



541 



CTTTTCAGCAGGACTTGAGTTCTTGTAGCACCTCATACGACTGAGGAATTAAGAATGTCG 



ScrFI 
BsaJI | 
ECORII | j 
Maelll 1 | I 

I III 



Hpyl78III 
BsmAI | 
CviRI |TaqI j 
II I 



Dpnl 
Sau3AI j 



Eco57I 
Fspl 



I I 



AAAACCTGATGTTACTCCTGGGGGTTGCATTATCGAGACTGAAGCAGGGATCATCAATGC 
TTTTGGACTAC AATGAGGAC C C C C AACGTAATAG CTCTG ACTTCGTCCCTAGTAG TTACG 



Alul 

CviJI 
Fnu4HI | 
Hhal | j 
Tselj j 
II I 



Tsp509I 
Bbvl ' 
Rsal 
BsrGI | 
SfaNI j 
Tat I | 
I 



Hpyl7 8III 
Alul | 
CviJI j 
Ddel Hindi I I | Taqi 
I III 



661 



GCAGCTTGATGTACAATTAGATGCCTTAGAAAAAGCTTTCTCGACTATACTAAAAGCGAA 
CGTCGAACTACATGTTAATCTACGGAATCTTTTTCGAAAGAGCTGATATGATTTTCGCTT 



Ddel 
Hpyl88IX 
Fokl 
BCCl 
BsmAI 
CviJI 



HaelV 
Hin4I 
BseMII 
MboII | 
Accl| | 
Sfcl || | 
I II I 



MboII 
BsaJI | 
BstDSI j 
MboII j 
I I 



GAACCCTGTAGACGAGCCATCTGAGACTTCATCATCCACGGATTCTTCTTCTTTATCTAA 

+ y. + + + + 

CTTGGGACATCTGCTCGGTAGACTCTGAAGTAGTAGGTGCCTAAGAAGAAGAAATAGATT 



Hpyl78III Dpnl 

Dpnl | Sau3AI j 

Bell | | Alwl | | 

Sau3AI | j Msll| | j Taqi 

III II I I I 

TGATCAGGATAAGAAAGAATAAAGGTATTCACTATTATGCGATCCATTTTTCGATTTTCC 

781 + + + + + + 840 

ACTAGTCCTATTCTTTCTTATTTCCATAAGTGATAATACGCTAGGTAAAAAGCTAAAAGG 
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Fig. 6 (con't) 



Nlalll 

Bpull02I BsmAI I BsrI 

Hgal | BsmBI | PshAI 

BseMII | Ddel Mwol | | BsaHI | Hgal 

I! I III III 

CTTTGTTTTTTTACGCTGAGCGTCTCATGCTGATTTGCTGACGCCAGTCTATATGAAAAC 

841 + + + + + + 900 

- GAAACAAAAAAATGCGACTCGCAGAGTACGACTAAACGACTGCGGTCAGATATACTTTTG 
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Figure 7: CPN 100424 

tgttcgcgat tggcactaat cccccctttt gttatggtga ataaaaaggt atgcgtggat 60 

tatggttcgt cgatctattt ctttttgctt gttctttcta atg aca ttg ctg tgc 115 

Met Thr Leu Leu Cys 
1 5 

tgt aca age tgt aac age agg tct eta att gtg cac ggt ctt cct ggc 163 

Cys Thr Ser Cys Asn Ser Arg Ser Leu lie Val His Gly Leu Pro Gly 
10 15 20 

aga gaa gcg aat gag att gtg gtg ctt ttg gta age aaa ggg gtg get 211 

Arg Glu Ala Asn Glu lie Val Val Leu Leu Val Ser Lys Gly Val Ala 
25 30 35 

gca caa aaa ttg cct caa get gca gcg get aca gee gga gca get act 259 

Ala Gin Lys Leu Pro Gin Ala Ala Ala Ala Thr Ala Gly Ala Ala Thr 
40 45 50 

gag caa atg tgg gat ate gcg gtt ccg tea gca caa ate aca gag gee 3 07 

Glu Gin Met Trp Asp lie Ala Val Pro Ser Ala Gin lie Thr Glu Ala 
55 60 65 

ctt gec att eta aat caa gcg ggt ctt cca cgt atg aaa ggg aca age 355 

Leu Ala lie Leu Asn Gin Ala Gly Leu Pro Arg Met Lys Gly Thr Ser 
70 75 80 85 

ctg tta gat ctt ttt gca aaa caa ggt ctt gtt cct tec gag ctt cag 403 

Leu Leu Asp Leu Phe Ala Lys Gin Gly Leu Val Pro Ser Glu Leu Gin 

90 95 100 

gaa aaa ate cgt tat caa gaa ggc tta tea gaa cag atg gec tct acg 451 

Glu Lys lie Arg Tyr Gin Glu Gly Leu Ser Glu Gin Met Ala Ser Thr 
105 110 115 

att aga aaa atg gat ggc gtt gtc gat gee tea gta cag att tec ttc 499 

lie Arg Lys Met Asp Gly Val Val Asp Ala Ser Val Gin lie Ser Phe 
120 125 130 

act aca gaa aat gaa gat aat ctt cct tta aca gee tct gtg tat att 547 

Thr Thr Glu Asn Glu Asp Asn Leu Pro Leu Thr Ala Ser Val Tyr lie 
135 140 145 

aag cat cga ggg gtt ttg gac aat ccg aac age att atg gtt tec aaa 595 

Lys His Arg Gly Val Leu Asp Asn Pro Asn Ser lie Met Val Ser Lys 
150 155 160 165 

att aag cgc ctt att gca agt get gtt cca gga ctt gtg cca gag aac 64 3 

lie Lys Arg Leu lie Ala Ser Ala Val Pro Gly Leu Val Pro Glu Asn 

170 175 180 

gtc tct gta gtg age gat cgc gca get tat agt gat att aca att aat 691 

Val Ser Val Val Ser Asp Arg Ala Ala Tyr Ser Asp lie Thr lie Asn 
185 190 195 
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ggt cct tgg gga tta aca gaa gaa ate gat tat gtt tct gtt tgg ggt 739 
Gly Pro Trp Gly Leu Thr Glu Glu lie Asp Tyr Val Ser Val Trp Gly 
200 205 210 

att att ctt gcg aag tct teg etc acc aaa ttc cgt etc att ttt tat 787 
lie lie Leu Ala Lys Ser Ser Leu Thr Lys Phe Arg Leu lie Phe Tyr 
215 220 225 

gtc ttg att etc att tta ttt gtt att tct tgt ggt etc ctt tgg gtc 835 
Val Leu lie Leu He Leu Phe Val He Ser Cys Gly Leu Leu Trp Val 
230 235 240 245 

att tgg aaa act cat act etc att atg act atg gga ggt aca aaa ggg 883 
He Trp Lys Thr His Thr Leu He Met Thr Met Gly Gly Thr Lys Gly 
250 255 260 

ttc ttc aac cct aca cca tat aca aag aat gec ttg gaa gee aag aaa 931 
Phe Phe Asn Pro Thr Pro Tyr Thr Lys Asn Ala Leu Glu Ala Lys Lys 
265 270 275 

gee gag gga gca get get gac aaa gag aaa aaa gaa gat gca gat tea 979 
Ala Glu Gly Ala Ala Ala Asp Lys Glu Lys Lys Glu Asp Ala Asp Ser 
280 285 290 

cag ggg gaa age aaa aat gcg gaa acc agt gat aaa gac tct agt gat 1027 
Gin Gly Glu Ser Lys Asn Ala Glu Thr Ser Asp Lys Asp Ser Ser Asp 
295 300 305 

aaa gat get cca gaa gga age aat gaa att gag ggt get tagtgactgc 1076 
Lys Asp Ala Pro Glu Gly Ser Asn Glu He Glu Gly Ala 
310 315 320 

caacactttt ggaactctag acatcttgat gaagcactcc aaggaagatg acctctccag 1136 
gtttcttcct aaaaatcttc ttgttgaatc tcctcatccc gaagaaatcc ctttaaaatc 1196 
ttta 1200 
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Figure 8 (RY-36) 

Restriction analysis of CPN 100424 



Hpyl78III 
Nrul 
Thai 



Hphl 

I I 
TGTTCGCGATTGGCACTAATCCCCCCTTTTGTTATGGTGAATAAAAAGGTATGCGTGGAT 

ACAAGCGCTAACCGTGATTAGGGGGGAAAACAATACCACTTATTTTTCCATACGCACCTA 



Tthlllll 
Dpnl | 
Sau3AI | I 
Drdll TaqI | j j 

I II ' ' 



Mwol 
Rsal | 
BsrGI | j 
BsrDI MslI TatI j j 

I I Mi 



TATGGTTCGTCGATCTATTTCTTTTTGCTTGTTCTTTCTAATGACATTGCTGTGCTGTAC 

ATACCAAGCAGCTAGATAAAGAAAAACGAACAAGAAAGATTACTGTAACGACACGACATG 

ScrFI 
EcoRII 
Taal 
BsiHKAI 
Bspl286I 
BseSI 
CviRI 
ApaLI 
Bbsl 
MboII 
I I 
I 



Maelll 
Alul | 
BspMI j 
CviJI j Mwol 
I I I 



BsmAI 
Tsp509I 
I 



AAGCTGTAACAGCAGGTCTCTAATTGTGCACGGTCTTCCTGGCAGAGAAGCGAATGAGAT 

12 i + + + + + + 180 

TTCGACATTGTCGTCCAGAGATTAACACGTGCCAGAAGGACCGTCTCTTCGCTTACTCTA 
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Fig. 8 (con't) 



Tsp509I 
CviRI | 



Sfcl 
CviJI | 
Fnu4HI 
Taul 
Acil | 
MspAlI j 
Mwol | 
PstI j 
Fnu4HI | 
Mnll j 
CviRI I 



Bce83I I | 

Fnu4HI j j 

Bbvl CviJI | j j 

Bsgl| Tselj | Bbvl j 

II M I II 

TGTGGTGCTTTTGGTAAGCAAAGGGGTGGCTGCACAAAAATTGCCTCAAGCTGCAGCGGC 

181 + + + + + + 240 

ACACCACGAAAACCATTCGTTTCCCCACCGACGTGTTTTTAACGGAGTTCGACGTCGCCG 



Tsel | 
Fnu4HI 
Sfcl 
Alul | 
Cvi JI j 
Smll Tsel j 
I I 



Ddel 
AlwNI j 
RleAI j 
Alul | j 
CviJI j | 
Fnu4HI | | | 
CjePI || || 
Mwol || || 
Tsel jj || 



BseMII |j| j j CjePI 

Mspl | | | | || NlalV | 

Bbvl | j j j j j I Acil | | 

CviJI j j j j I II Thai j | 

Mwol I III II Bbvl EcoRV | | | Mnll 

I II I II I II I I I I I I 
TACAGCCGGAGCAGCTACTGAGCAAATGTGGGATATCGCGGTTCCGTCAGCACAAATCAC 
241 + + + + + + 300 



SimI 

CviJI Acil | 

Haelll Bbsl | j 

CjePI | MboII j j BsaAI 

EcoO109l| Faul | j j CjePI | 

Sau96I Sthl32l| j j j Maell j CviJI 
II II I I I II I 
AGAGGCCCTTGCCATTCTAAATCAAGCGGGTCTTCCACGTATGAAAGGGACAAGCCTGTT 
301 + + + + + + 360 



TCTCCGGGAACGGTAAGATTTAGTTCGCCCAGAAGGTGCATACTTTCCCTGTTCGGACAA 
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Fig. 8 (con't) 

Dpnl 
Bglll 
BsmFI 
BstYI 
Sau3AI 



CviRI 
I 



Hpyl78III 
Alul | 
CviJI j 



Eco57I 
I 



Hpyl8 8IX 

! 



AGATCTTTTTGCAAAACAAGGTCTTGTTCCTTCCGAGCTTCAGGAAAAAATCCGTTATCA 
TCTAGAAAAACGTTTTGTTCCAGAACAAGGAAGGCTCGAAGTCCTTTTTTAGGCAATAGT 



Pf 111081 
CviJI 
Hael 

Haelll | Bed Fokl 

Bed | j Mnll SfaNI TaqI | 

Mil ! i I 

AGAAGGCTTATCAGAACAGATGGCCTCTACGATTAGAAAAATGGATGGCGTTGTCGATGC 

421 + + + + + + 480 

TCTTCCGAATAGTCTTGTCTACCGGAGATGCTAATCTTTTTACCTACCGCAACAGCTACG 



Hpyl8 8IX 
CviJI | 
I 



Rsal 

TatI j BseMII Msel AlwNI 

Ddel [ | Mnll | Sfcl MboII MboII j CviJI | 

I I I I I I ! II II 

CTCAGTACAGATTTCCTTCACTACAGAAAATGAAGATAATCTTCCTTTAACAGCCTCTGT 

481 + + + + + + 540 

GAGTCATGTCTAAAGGAAGTGATGTCTTTTACTTCTATTAGAAGGAAATTGTCGGAGACA 

Mnll 
CjePI | 

Msel j Msel 
Mnll j j TaqI SfaNI Hpyl88IX CjePI Tsp509I | 

III I I II II 

GTATATTAAGCATCGAGGGGTTTTGGACAATCCGAACAGCATTATGGTTTCCAAAATTAA 

541 + + + + + + 600 

CATATAATTCGTAGCTCCCCAAAACCTGTTAGGCTTGTCGTAATACCAAAGGTTTTAATT 



Haell Cjel 
Hhal | CviRI | 

I I 



Bsp24I 
CjePI 
Cjel| 
ScrFI | | 
EcoRII j | | 
MM 



Hin4I 
Mae 1 1 

I 



BsmAI 
BsmBI 
Cjel| 
Sfcl j 
I I 



Cjel 
CjePI 
Dpnl 
Bsp24I | 
Sau3AI | j 
I I I 



GCGCCTTATTGCAAGTGCTGTTCCAGGACTTGTGCCAGAGAACGTCTCTGTAGTGAGCGA 



CGCGGAATAACGTTCACGACAAGGTCCTGAACACGGTCTCTTGCAGAGACATCACTCGCT 
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Fig. 8 (con't) 



Alul 
CviJI 
Fnu4HI 
Hhal 
Tsel 
Thai 
Bpll 
BsiEI 
Pvul 
Sgf I 



BsaJI 
Styl 
Avail 
Sau96I 
Msel | 
Bbvl Vspl j 

21 |Tsp509I | j 
II ' 



Cjel 
Msel | 
I I 



Tthlllll 
Clal | 
Taql | 

I I 



TCGCGCAGCTTATAGTGATATTACAATTAATGGTCCTTGGGGATTAACAGAAGAAATCGA 

661 + + + + + + 

AGCGCGTCGAATATCACTATAATGTTAATTACCAGGAACCCCTAATTGTCTTCTTTAGCT 

Bsp24I 
CjePI 

Bbsl Cjel | Apol BsmAI 

MboII MboII HphI | j Tsp509I BsmBI 

I I III I I 

TTATGTTTCTGTTTGGGGTATTATTCTTGCGAAGTCTTCGCTCACCAAATTCCGTCTCAT 

721 + + + + + + 

AAT ACAAAGACAAAC C CCAT AATAAGAACGCTTCAGAAGCGAGTG GTTTAAGGCAG AGTA 



Hinf I 
Tfil 
Hpyl78III 
Cjel| 
CjePI | 
Bsp24I | | 
III 



Bsal 
BsmAI 



TTTTTATGTCTTGATTCTCATTTTATTTGTTATTTCTTGTGGTCTCCTTTGGGTCATTTG 



AAAAATACAGAACTAAGAGTAAAATAAACAATAAAGAACACCAGAGGAAACCCAGTAAAC 



MboII 

Mnll Rsal | 

I I I 

GAAAACTCATACTCTCATTATGACTATGGGAGGTACAAAAGGGTTCTTCAACCCTACACC 

841 + + + + + + 900 

CTTTTGAGTATGAGAGTAATACTGATACCCTCCATGTTTTCCCAAGAAGTTGGGATGTGG 
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Fig. 8 (con't) 



BsaJI 
CviJI | 

BsmI Bbvl | | 

BsaJI j CviJI Mwol | j j 
Styl|MwoI j Mnll | Ml 

II I I II Ml 

ATATACAAAGAATGCCTTGGAAGCCAAGAAAGCCGAGGGAGCAGCTGCTGACAAAGAGAA 

901 + + + + + + 960 

TATATGTTTCTTACGGAACCTTCGGTTCTTTCGGCTCCCTCGTCGACGACTGTTTCTCTT 



AlwNI 
Fnu4HI 

Alul 
CviJI 
MspAlI 
PvuII 
Tsel 
Fnu4HI [ 
Tselj | 
Mwol | j j 
I I 



SfaNI 
Bbvl | 
I 



MboII Hinfl 
Hinfl | Hin4I | 

Tfil | TspRI | |BfaI 

CviRI | | Acil BsrI Plel | j | Bpml 

III I I I I I I I 

AAAAGAAGATGCAGATTCACAGGGGGAAAGCAAAAATGCGGAAACCAGTGATAAAGACTC 

961 + + + + + + 1020 

TTTTCTTCTACGTCTAAGTGTCCCCCTTTCGTTTTTACGCCTTTGGTCACTATTTCTGAG 



SfaNI 

I 



BsrDI 
Tsp509I | 
Mnll | | 
■ II 



Maelll 
Tsp45I 
Ddel | 
I I 



Bsbl 
I 



TAGTGATAAAGATGCTCCAGAAGGAAGCAATGAAATTGAGGGTGCTTAGTGACTGCCAAC 

1021 + + + + + + 108 

ATCACTATTTCTACGAGGTCTTCCTTCGTTACTTTAACTCCCACGAATCACTGACGGTTG 



ScrFI 

Hpyl78III EcoRII 
Hpyl78III | Bpml HgiEII 

Bfal| j BsaJI | Hin4I | 

Xbal | j MslI Stylj CjePI | MboII |MnlI 

III I II I I I I I 

ACTTTTGGAACTCTAGACATCTTGATGAAGCACTCCAAGGAAGATGACCTCTCCAGGTTT 

1081 + + + + + + 1140 

TGAAAACCTTGAGATCTGTAGAACTACTTCGTGAGGTTCCTTCTACTGGAGAGGTCCAAA 



MboII 
I 



Hinfl 
CjePI | 
BseRI | j 
Fokl | Tfil 
I I 



Sthl32I 
Mnll | 
Hpyl78III | j 

I I I 



Oral 
MboII | 
Msel j 

II 



CTTCCTAAAAATCTTCTTGTTGAATCTCCTCATCCCGAAGAAATCCCTTTAAAATCTTTA 

1141 + + + + + + 1200 

GAAGGATTTTTAGAAGAACAACTTAGAGGAGTAGGGCTTCTTTAGGGAAATTTTAGAAAT 
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Figure 9: CPN10 042 6 

ttgaacccta tggaaatgta tcttatttgt gctgggctat atttcttaat gacaacatca 60 

ttttcctgta tttctaggtt atcagaaaag agaaggagtt atg aca att aga gtc 115 

Met Thr lie Arg Val 



cga aac ctt gcc tac tct gta aat aag aaa aag att eta gat ggt gta 
Arg Asn Leu Ala Tyr Ser Val Asn Lys Lys Lys lie Leu Asp Gly Val 
10 15 20 

act ttt tct tta gag cga ggg cac att aca ctg ttt gtt ggg aag agt 
Thr Phe Ser Leu Glu Arg Gly His lie Thr Leu Phe Val Gly Lys Ser 



ggt tea gga aaa aca atg att tta cgt get ttg gcg ggc tta gtc cag 

Gly Ser Gly Lys Thr Met lie Leu Arg Ala Leu Ala Gly Leu Val Gin 

40 45 50 

ccc act caa gga gat att tgg att gaa ggg gag get cca get eta gtt 

Pro Thr Gin Gly Asp lie Trp lie Glu Gly Glu Ala Pro Ala Leu Val 



ttc caa caa ccc gag tta ttt tec cat atg aca gta tta gga aat tgc 
Phe Gin Gin Pro Glu Leu Phe Ser His Met Thr Val Leu Gly Asn Cys 



acc cat cca caa ate cat ate aag ggt cgt agt acc gaa gaa get cga 

Thr His Pro Gin lie His lie Lys Gly Arg Ser Thr Glu Glu Ala Arg 
90 95 100 

gaa aag gcg ttc gag ctt tta cat ttg ttg gat att gaa gag gtt get 

Glu Lys Ala Phe Glu Leu Leu His Leu Leu Asp lie Glu Glu Val Ala 
105 110 115 

aag aat tat cct gac cag etc tct ggg gga caa aaa caa cgt gtg get 

Lys Asn Tyr Pro Asp Gin Leu Ser Gly Gly Gin Lys Gin Arg Val Ala 
120 125 130 

att gta cgt tct tta tgt atg gat aaa cat aca tta ctt ttt gat gaa 

lie Val Arg Ser Leu Cys Met Asp Lys His Thr Leu Leu Phe Asp Glu 
135 140 145 

cct aca teg get tta gat cct ttt get acg gca teg ttc cga cat ctt 

Pro Thr Ser Ala Leu Asp Pro Phe Ala Thr Ala Ser Phe Arg His Leu 

150 155 150 165 

tta gaa aca ctt cga gac cag gaa ctg act gta ggg tta act act cat 

Leu Glu Thr Leu Arg Asp Gin Glu Leu Thr Val Gly Leu Thr Thr His 
170 175 180 

gac atg caa ttt gtt cat agt tgt ttg gat cgt ate tat ctt ata gat 

Asp Met Gin Phe Val His Ser Cys Leu Asp Arg lie Tyr Leu He Asp 
185 190 195 
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Fig. 9 (con't) 

caa gga act gtt gcg ggg gtc tat gac aag cgt gac gga gag etc gat 73 9 
Gin Gly Thr Val Ala Gly Val Tyr Asp Lys Arg Asp Gly Glu Leu Asp 
200 205 210 

tct ggt cat cca tta teg aaa tat ate cac tct get caa taggactaca 788 
Ser Gly His Pro Leu Ser Lys Tyr lie His Ser Ala Gin 
215 220 225 

getgetagag cagctgtagt gatactttag aatcctgacc agtggcagga atgageggea 848 

tg 850 
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Figure 10 (RY-37) 

Restriction enzyme analysis of CPN 100426 

CviJI Msel 

I I 

TTGAACCCTATGGAAATGTATCTTATTTGTGCTGGGCTATATTTCTTAATGACAACATCA 

1 + + + + + + 60 

AACTTGGGATACCTTTACATAGAATAAACACGACCCGATATAAAGAATTACTGTTGTAGT 

Hpyl8 8IX 

Bfal Hpyl88IX Tsp509I Hinfl |PleI 

II I I I I 

TTTTCCTGTATTTCTAGGTTATCAGAAAAGAGAAGGAGTTATGACAATTAGAGTCCGAAA 

61 + + + + + + 120 

AAAAGGACATAAAGATCCAATAGTCTTTTCTCTTCCTCAATACTGTTAATCTCAGGCTTT 

Hpyl7 8III 
Bfal | 
Xbal| | 

Hinfl | | j Maelll Mill I 

CjePI Tfil j j | Bed | CjePl| 

I I III I I M 

CCTTGCCTACTCTGTAAATAAGAAAAAGATTCTAGATGGTGTAACTTTTTCTTTAGAGCG 

121 + + + + + + 180 

GGAACGGATGAGACATTTATTCTTTTTCTAAGATCTACCACATTGAAAAAGAAATCTCGC 

Bspl286I Hpyl78III Faul 

Tthlllll| Earl MboII Sth.l32l| 

Bmgl | j TspRI 1 Drdll | BsaAI | j 

BseSljj Taal | j Aid | j Cjel Maell j j j 

III III III I I I M 

AGGGCACATTACACTGTTTGTTGGGAAGAGTGGTTCAGGAAAAACAATGATTTTACGTGC 

181 + + + + + + 240 

TCCCGTGTAATGTGACAAACAACCCTTCTCACCAAGTCCTTTTTGTTACTAAAATGCACG 

Ddel 
Bce83I | 
CviJI j Cjel 
Cac8I | j CviJI BslI Mnll NlalV Alul 

Acil | II BspGI | Smll | Bpml |CjeI CviJI j CviJI 

I I I I II II III II I 

TTTGGCGGGCTTAGTCCAGCCCACTCAAGGAGATATTTGGATTGAAGGGGAGGCTCCAGC 

241 + + + + + + 300 

AAACCGCCCGAATCAGGTCGGGTGAGTTCCTCTATAAACCTAACTTCCCCTCCGAGGTCG 

Sthl32I 

Cjel | Mmel Tsp509I 

Bfal Acelll Aval | j Ndel | Taal Fokl | CviRI Bed 

I I III II I II I I 

TCTAGTTTTCCAACAACCCGAGTTATTTTCCCATATGACAGTATTAGGAAATTGCACCCA 

301 + + + + + + 360 

AGATCAAAAGGTTGTTGGGCTCAATAAAAGGGTATACTGTCATAATCCTTTAACGTGGGT 
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Fig. 10 (con't) 



Taqll BslI 
I 



Pf 111081 
SimI | 



| Rsal 
I I 



MboII 
Hpyl78III 
TaqI 
Aval | 
Smll I 
Xhol I 
Alul | I 
Cvi JI 1 j 
I I I 



Alul 
CviJI 
Cjel j 
Taql j 
Mme I | | 



TCCACAAATCCATATCAAGGGTCGTAGTACCGAAGAAGCTCGAGAAAAGGCGTTCGAGCT 

361 + + + + + + 420 

AGGTGTTTAGGTATAGTTCCCAGCATCATGGCTTCTTCGAGCTCTTTTCCGCAAGCTCGA 



Mnll 
Earl | 

I I 



Hpyl7 8III 
Tsp509I 
MboII | 
Ddel | j 
Cjel | | | 
I I I I 



BstXI 
Alul | 
CviJI |AceIII 
Hin4l| j Cjel | 
II I I I 



TTTACATTTGTTGGATATTGAAGAGGTTGCTAAGAATTATCCTGACCAGCTCTCTGGGGG 

421 + + + + + + 480 

AAATGTAAACAACCTATAACTTCTCCAACGATTCTTAATAGGACTGGTCGAGAGACCCCC 

BsmFI 

AflHI | Mae 1 1 

Mae I I j CviJI Rsal | Cjel 

III II 1 

ACAAAAACAACGTGTGGCTATTGTACGTTCTTTATGTATGGATAAACATACATTACTTTT 

481 + + + + + + 540 

TGTTTTTGTTGCACACCGATAACATGCAAGAAATACATACCTATTTGTATGTAATGAAAA 



Dpnl 
BstYI | 

Sau3AI | Bcefl 
Alwl | j SfaNI | 

CviJI j j Hpyl8 8IX| j 

III Ml 

TGATGAACCTACATCGGCTTTAGATCCTTTTGCTACGGCATCGTTCCGACATCTTTTAGA 

541 + + + + + + 600 

ACTACTTGGATGTAGCCGAAATCTAGGAAAACGATGCCGTAGCAAGGCTGTAGAAAATCT 



ScrFI 
ECORII 
Mme I 
Hpyl78III 
Taql 
Bsal | 
BsmAI j 
Xranl | 
I I 



Tthlllll 
Tsp509I 

CviRl 
Nlalll 
Nlalll j 
Hpyl78III 



Taal 
AlwNI SfCl| 
I I I 



Hindi 

Hpal | 
Msel| Real | 
II I I 

AACACTTCGAGACCAGGAACTGACTGTAGGGTTAACTACTCATGACATGCAATTTGTTCA 

601 + + 1- + + + 660 

TTGTGAAGCTCTGGTCCTTGACTGACATCCCAATTGATGAGTACTGTACGTTAAACAAGT 



Nspl 
I 
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Dpnl 
Sau3AI | 
I I 



Alwl 
I 



Faul 
Sthl32I | 
Dpnl | | 

Sau3AI | j | 

I I M 



Acil SimI 
I I 



TAGTTGTTTGGATCGTATCTATCTTATAGATCAAGGAACTGTTGCGGGGGTCTATGACAA 



ATCAACAAACCTAGCATAGATAGAATATCTAGTTCCTTGACAACGCCCCCAGATACTGTT 



Maelll 
Tsp45I 



Hinf I 
Tf il 
Banll 
BsiHKAI 
Bspl286I 
SacI 
Taql 
Alul | 
CviJT | 
Fokl | | 
II I 



Taql Bbvl 

.. - . I I 

GCGTGACGGAGAGCTCGATTCTGGTCATCCATTATCGAAATATATCCACTCTGCTCAATA 

721 + + + + + + 780 

CGCACTGCCTCTCGAGCTAAGACCAGTAGGTAATAGCTTTATATAGGTGAGACGAGTTAT 



Sfcl 
Alul 
CviJI 
MspAlI 
PvuII 
Fnu4HI 
Mwol 
Tsel 
Mwol 
Bfal 



Fnu4HI 
Alul | 
CviJI j 
MspAlI j 
PvuII j 
Sfcl Tsel j 
I I I 



Hpyl78III 
Hinf I | 
Tfil |BsrI 
I I 



TspRI 
BslI | 

I I 



GGACTACAGCTGCTAGAGCAGCTGTAGTGATACTTTAGAATCCTGACCAGTGGCAGGAAT 



CCTGATGTCGACGATCTCGTCGACATCACTATGAAATCTTAGGACTGGTCACCGTCCTTA 
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Fig. 10 (con't) 

Fnu4HI 
Taul 
Acil | 
BsrBI |NlaIII 
II I 
GAGCGGCATG 

841 + 850 

CTCGCCGTAC 
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Figure 11: CPNie0508 

ctctgattta tggtaattct ttattttcag agccgtcaag tcctttctat tctgttgaat 60 

ttcctaataa cgtaagtaat aaacaatcaa aagtccgcat atg aaa aga cct ttt 115 

Met Lys Arg Pro Phe 
1 5 

ttt acc tat eta tgc ate ate ttc tac gga tct tgt gca teg tta tct 163 

Phe Thr Tyr Leu Cys lie lie Phe Tyr Gly Ser Cys Ala Ser Leu Ser 
10 15 20 

tta cat gca gga etc tct ttc cca gaa gta cgt gga get acg get get 211 
Leu His Ala Gly Leu Ser Phe Pro Glu Val Arg Gly Ala Thr Ala Ala 



gtt gtc cat gec gac tct ggg aag gta ttc tat gat aaa gac ata gat 

Val Val His Ala Asp Ser Gly Lys Val Phe Tyr Asp Lys Asp lie Asp 

Val Val His Ala Asp Ser Gly Lys Val Phe Tyr Asp Lys Asp He Asp 
40 45 50 

get gta ate tat cct gec age atg acg aaa ate gca act gec etc ttt 

Ala Val He Tyr Pro Ala Ser Met Thr Lys He Ala Thr Ala Leu Phe 

Ala Val He Tyr Pro Ala Ser Met Thr Lys He Ala Thr Ala Leu Phe 

55 60 65 

ate eta aag cac tat ccc aca gtc etc gat act etc ate aaa gtc aaa 

He Leu Lys His Tyr Pro Thr Val Leu Asp Thr Leu He Lys Val Lvs 

He Leu Lys His Tyr Pro Thr Val Leu Asd Thr Leu He Lys Val Lys 

70 75 80 85 

caa gat gcg ate get tec ate act ccg caa gca aaa aaa caa tea gga 

Gin Asp Ala He Ala Ser He Thr Pro Gin Ala Lys Lys Gin Ser Gly 

Gin Asp Ala He Ala Ser lie Thr Pro Gin Ala Lys Lys Gin Ser Gly 

90 95 100 

tat cgt agt cct ccc cac tgg tta gaa act gat gga tct aca ata cag 

Tyr Arg Ser Pro Pro His Trp Leu Glu Thr Asp Gly Ser Thr He Gin 

Tyr Arg Ser Pro Pro His Trp Leu Glu Thr Asp Gly Ser Thr He Gin 
105 110 115 

etc cat ctt cga gaa gag ctt tta ggg tgg gac ctg ttc cac gec tta 

Leu His Leu Arg Glu Glu Leu Leu Gly Trp Asp Leu Phe His Ala Leu 

Leu His Leu Arg Glu Glu Leu Leu Gly Trp Asp Leu Phe His Ala Leu 
120 125 130 

ctg gtc tgt tct get aat gat get gcg aat gtc tta get atg gca tgt 
Leu Val Cys Ser Ala Asn Asp Ala Ala Asn Val Leu Ala Met Ala Cys 
Leu Val Cys Ser Ala Asn Asp Ala Ala Asn Val Leu Ala Met Ala Cys 
135 140 145 

tgc gga tct gta gag aag ttt atg gat aag ctg aac ttc ttc tta aaa 

Cys Gly Ser Val Glu Lys Phe Met Asp Lys Leu Asn Phe Phe Leu Lys 

Cys Gly Ser Val Glu Lys Phe Met Asp Lys Leu Asn Phe Phe Leu Lys 

150 155 160 165 

gaa gaa ate ggc tgc act cat acc cat ttt aat aat ccc cat ggg tta 
Glu Glu lie Gly Cys Thr His Thr His Phe Asn Asn Pro His Gly Leu 
Glu Glu He Gly Cys Thr His Thr His Phe Asn Asn Pro His Gly Leu 
170 175 130 
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cat cat ccg aat cac tat act aca acc cgt gat ctt att age a:: atg 691 

His His Pro Asn His Tyr Thr Thr Thr Arg Asp Leu lie Ser lie Met 

His His Pro Asn His Tyr Thr Thr Thr Arg Asp Leu He Ser He Met 
185 190 195 

cgt tgc get ctg aaa gaa cct cca ttt cga ggg gtc ate tec acg aca 739 

Arg Cys Ala Leu Lys Glu Pro Pro Phe Arg Gly Val lie Ser Thr Thr 

Arg Cys Ala Leu Lys Glu Pro Pro Phe Arg Gly Val He Ser Thr Thr 

200 205 210 



age tat aaa ata ggg get aca aac ctg cat ggc gaa egg ate eta tec 787 

Ser Tyr Lys He Gly Ala Thr Asn Leu His Gly Glu Arg He Leu Ser 

Ser Tyr Lys He Gly Ala Thr Asn Leu His Gly Glu Arg He Leu Ser 
215 220 225 



cca aca aac aaa 

Pro Thr Asn Lys 

Pro Thr Asn Lys 
230 



ttg ctt ctt cct 

Leu Leu Leu Pro 

Leu Leu Leu Pro 
235 



ggg tct acc tac 
Gly Ser Thr Tyr 
Gly Ser Thr Tyr 
240 



His Tyr Pro Pro 
His Tyr Pro Pro 
245 



get tta gga ggg aaa aca ggg acc acc aag act gca ggg aaa aat eta 88 3 

Ala Leu Gly Gly Lys Thr Gly Thr Thr Lys Thr Ala Gly Lys Asn Leu 

Ala Leu Gly Gly Lys Thr Gly Thr Thr Lys Thr Ala Giv Lys Asn Leu 
250 25= " 260 



att atg get get gaa aaa aat aac cgc etc ttg gta acg ate gca acg 931 

He Met Ala Ala Glu Lys Asn Asn Arg Leu Leu Val Thr He Ala Thr 

He Met Ala Ala Glu Lvs Asn Asn Arg Leu Leu Val Thr He Ala Thr 
265 270 275 



ggc tat teg ggt cct gtg agt gat etc tac caa gat gtc att get eta 979 

Gly Tyr Ser Gly Pro Val Ser Asp Leu Tyr Gin Asp Val lie Ala Leu 

Gly Tvr Ser Glv Pro 

280 " 285 290 



tgt gaa acg gta ttt aac gag ccg eta tta aga aaa gag etc gtc cec 1027 
Cys Glu Thr Val Phe Asn Glu Pro Leu Leu Arg Lys Glu Leu Val Pro 
295 300 305 



ccc tec gac tgt etc caa tta gaa 
Pro Ser Asp Cys Leu Gin Leu Glu 
310 315 

tgc cct ctt cct gag gga etc tac 
Cys Pro Leu Pro Glu Gly Leu Tyr 
330 



ata gcg aat ctt ggg aag ctt tct 1075 
He Ala Asn Leu Gly Lys Leu Ser 
320 325 

tat gac ttc tat gee tec gaa gat 1123 
Tyr Asp Phe Tyr Ala Ser Glu Asp 
335 340 



cgc gaa cct ctt tct gta tct ttt att gca cat gcg gac gec ttc cct 

Arg Glu Pro Leu Ser Vai Ser Phe He Ala His Ala Aso Ala Phe Pro 

345 350 " 355 

att gaa caa gga gat ctt ctt ggt cat tgg gtt ttt tat gac gat gaa 

He Glu Gin Gly Asp Lau Leu Gly His Trp Val Phe Tyr Aso Aso Glu 

360 * 365 " 370 
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Fig. 1 1 (con't) 

ggc aag aaa att tct tec cag cct ttc tat gec cct tgt cgt ttt gag 1267 

Gly Lys Lys lie Ser Ser Gin Pro Phe Tyr Ala Pro Cys Arg Phe Glu 
375 380 385 

cgc act ate aag cct tgg aaa etc tat atg aaa cgt gtc ttc aca teg 1315 
Arg Thr lie Lys Pro Trp Lys Leu Tyr Met Lys Arg Val Phe Thr Ser 
390 39S 400 405 

tat aga ace tat atg tct ata ace atg ctg etc atg tat ttt cgc ate 1363 
Tyr Arg Thr Tyr Met Ser lie Thr Met Leu Leu Met Tyr Phe Arg lie 
410 415 420 

cgc aag cac cgc aag tat aaa aat tta aaa cac tat tct aaa ate 1408 
Arg Lys His Arg Lys Tyr Lys Asn Leu Lys His Tyr Ser Lys lie 
425 430 435 

taactttttc ttttaattta taaaaaacca aaggtttatg taagatttgc gcttttcaat 1468 

ccaacaagaa tcccttgtgc gcacattact tt 1500 
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Figure 12 (RY-39) 

Restriction enzyme analysis of CPN1 00508 

Bcefl CviJI Apol 

Hpyl88IX Tsp509I j HpylSSIX | Tsp509I 

I I I 1 I I 

CTCTGATTTATGGTAATTCTTTATTTTCAGAGCCGTCAAGTCCTTTCTATTCTGTTGAAT 

! + + + + + + 

. GAGACTAAATACCATTAAGAAATAAAAGTCTCGGCAGTTCAGGAAAGATAAGACAACTTA 

Ndel 

Mae I I Acil | 

I i I 



AAGGATTATTGCATTCATTATTTGTTAGTTTTCAGGCGTATACTTTTCTGGAAAAAAATG 



Nsil 
CviRI | 
MboII | | 
I I I 



Dpnl 
BstYI j 
Sau3AI j 
SfaNI | j 

I 1 



Hinf I 
CviRI 
Nlalll 
Nspl 
Plel | 
I I 



CviRI 

Alwl| SfaNI 

II I 

CTATCTATGCATCATCTTCTACGGATCTTGTGCATCGTTATCTTTACATGCAGGACTCTC 

121 + + + + + + 180 

GATAGATACGTAGTAGAAGATGCCTAGAACACGTAGCAATAGAAATGTACGTCCTGAGAG 



CjePI 
BsaAI Fnu4HI 
Bbvl j CviJI j 

Maellj Alul Mwol | 
Rsal | j CviJI Tsel j 
III I II 



Hinf I 
Xcml 
Nlalll | 
Bcefl | j 
Plel | | |BslI 
II I I I 



TTTCCCAGAAGTACGTGGAGCTACGGCTGCTGTTGTCCATGCCGACTCTGGGAAGGTATT 
AAAGGGTCTTCATGCACCTCGATGCCGACGACAACAGGTACGGCTGAGACCCTTCCATAA 



SfaNI 
CjePI | 



Nlalll 
Cac8I | 

I I I 

CTATGATAAAGACATAGATGCTGTAATCTATCCTGCCAGCATGACGAAAATCGCAACTGC 

241 + + + + + + 300 

GATACTATTTCTGTATCTACGACATTAGATAGGACGGTCGTACTGCTTTTAGCGTTGACG 

Mnll 

Mnll Taal TaqI RleAI | SfaNI 

I I I II I 

CCTCTTTATCCTAAAGCACTATCCCACAGTCCTCGATACTCTCATCAAAGTCAAACAAGA 

301 + + + + + + 360 

GGAGAAATAGGATTTCGTGATAGGGTGTCAGGAGCTATGAGAGTAGTTTCAGTTTGTTCT 
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Tthlllll 
BsiEI 
Pvul 
Sgfl 
Dpnl | 
U3AI | | 
I II 



I I 



Pf 111081 
EcoRV | 



Hpyl78III 
Tthlllll | 
I I 



TGCGATCGCTTCCATCACTCCGCAAGCAAAAAAACAATCAGGATATCGTAGTCCTCCCCA 



Hpyl78III 
Acelll 

Dpnl Alul TaqI 

BsrI BstYI | CviJI Earl | 

TspRI Sau3AI j MboII | Sapl j 

Mnll | Bed j j Alwl | |BccI | | 

I I II I I I I I II 

CTGGTTAGAAACTGATGGATCTACAATACAGCTCCATCTTCGAGAAGAGCTTTTAGGGTG 



Alul 
CviJI 



GACCAATCTTTGACTACCTAGATGTTATGTCGAGGTAGAAGCTCTTCTCGAAAATCCCAC 



NlalV 
Avail | 
Eco0109I j 
Psp5II j 
Sau96I | 
I I 



Bbvl 
Sf aNI 
BsrI | 



Fnu4HI 
Tsel | 
Mwol | j 

I I I 



Alul 
CviJI 
Ddel | 
I I 



GGACCTGTTCCACGCCTTACTGGTCTGTTCTGCTAATGATGCTGCGAATGTCTTAGCTAT 



CCTGGACAAGGTGCGGAATGACCAGACAAGACGATTACTACGACGCTTACAGAATCGATA 



Sfcl 
Dpnl 
BstYI j 
Sau3AI j 

Acil | j | Alul Bbvl 

Nlalll Mil CviJI Bsgl | 

Nspl j j j j Alwl MboII Xnml Msel | | 

I I I I I I I I Ml 

GGCATGTTGCGGATCTGTAGAGAAGTTTATGGATAAGCTGAACTTCTTCTTAAAAGAAGA 

541 h + h + + v 600 

CCGTACAACGCCTAGACATCTCTTCAAATACCTATTCGACTTGAAGAAGAATTTTCTTCT 
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CviRI 
MboII 
Fnu4HI | 
Cvi JI | | 
Tsel | | 
I I I 



Msel 
I 



Maelll 
Nlalll 
BsaJI 
BstDSI 
Ncol 
Styl 
Fokl | 
I I 



Hinf I 
Tf il 
Hpyl8 8IX | 



AATCGGCTGCACTCATACCCATTTTAATAATCCCCATGGGTTACATCATCCGAATCACTA 
TTAGCCGACGTGAGTATGGGTAAAATTATTAGGGGTACCCAATGTAGTAGGCTTAGTGAT 



Sthl32I 
Dpnl | 
Sau3AI | | 
BscGI | | | 
I 



Hpyl88IX 

SfaNI | 
Nlalll | Hhal | 
I ' 



Mnll 
BslI | 
Mnll TaqI | | 
I III 



TACTACAACCCGTGATCTTATTAGCATCATGCGTTGCGCTCTGAAAGAACCTCCATTTCG 

1 + + + + + + 

ATGATGTTGGGCACTAGAATAATCGTAGTACGCAACGCGAGACTTTCTTGGAGGTAAAGC 

Dpnl 
NlalV 
BamHI 
BstYI 
Sau3AI 
Alwl 
BspMI | 

Hin4I Alul Nlalll | | 

SimI CviJI CviJI CviRI | | | 

I I | I III 

AGGGGTCATCTCCACGACAAGCTATAAAATAGGGGCTACAAACCTGCATGGCGAACGGAT 

! + + + + + + 

TCCCCAGTAGAGGTGCTGTTCGATATTTTATCCCCGATGTTTGGACGTACCGCTTGCCTA 



MboII 
Tsp509I 



ACCl 
SimI 
ScrFI 
BsaJI | 
EcoRII | | 
Tthlllll | I 
I I I 



BslI 
Alul | 
CviJI I 
Mnll 



Alwl 
I 

CCTATCCCCAACAAACAAATTGCTTCTTCCTGGGTCTACCTACCACTATCCCCCAGCTTT 
GGATAGGGGTTGTTTGTTTAACGAAGAAGGACCCAGATGGATGGTGATAGGGGGTCGAAA 



NlalV 
Avail | 
Sau96I | 
I I 



PStI 
CviRI I 
BsmFI | | 
Sfcl|| | 

Mi 



Tsp509I 
Bbvl | 



Fnu4HI 
CviJI | 
Tsel | 
I I 



AGGAGGGAAAACAGGGACCACCAAGACTGCAGGGAAAAATCTAATTATGGCTGCTGAAAA 

841 + + + + + + 900 

TCCTCCCTTTTGTCCCTGGTGGTTCTGACGTCCCTTTTTAGATTAATACCGACGACTTTT 
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CviJI 
BscGI 
Sthl32I 
BsiEI 
Pvul 
Dpnl 



Sau3AI 
Sthl32I 
Maelll | 
Acil BslI |Mnllj 
II II 



NlalV 
Avail j 
ECOO109I | 
Psp5II | 
Sau96I | 
SimI | 
I I 



Dpnl 
Sau3AI | 
I I 



AAATAACCGCCTCTTGGTAACGATCGCAACGGGCTATTCGGGTCCTGTGAGTGATCTCTA 

TTTATTGGCGGAGAACCATTGCTAGCGTTGCCCGATAAGCCCAGGACACTCACTAGAGAT 

Banll 
BsiHKAI 
Bspl286I 
Sad 
Alul | 
CviJI j 
I I 

CCAAGATGTCATTGCTCTATGTGAAACGGTATTTAACGAGCCGCTATTAAGAAAAGAGCT 

+ + h + + + 

GGTTCTACAGTAACGAGATACACTTTGCCATAAATTGCTCGGCGATAATTCTTTTCTCGA 



Taal Msel 



Acil 
Fnu4HI 

Taul BsmFI 
CviJI | Msel 
I I I 



BsmAI 
Tsp509I 
Mnll | 
Hpyl88IX Taal | j 
I III 



Hinf I 
Tf il 
Mmel | 
I I 



BseMII 
MboII 
Alul | 
CviJI j 
Hindlll | | 
I I I 



CGTCCCCCCCTCCGACTGTCTCCAATTAGAAATAGCGAATCTTGGGAAGCTTTCTTGCCC 

1021 + + + + + + 1080 

GCAGGGGGGGAGGCTGACAGAGGTTAATCTTTATCGCTTAGAACCCTTCGAAAGAACGGG 



Hpyl78III 
Nrul 
Thai 
Mnll | 
Dpnl | j 
Sau3AI | | | 
BsmFI Hpyl88IX | j j j MboII Mnll 
I I I Ml I I 

TCTTCCTGAGGGACTCTACTATGACTTCTATGCCTCCGAAGATCGCGAACCTCTTTCTGT 

1081 + + + + + + 1140 

AGAAGGACTCCCTGAGATGATACTGAAGATACGGAGGCTTCTAGCGCTTGGAGAAAGACA 



Hinf I 
Mnll 
Bsu36I | 
Ddel j 
Earl j 
Hpyl78III j 
Mnll Plel j 
I I 
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Fig. 12 (con't) 



BsaHI 
Acil | 
Nlalll | 
CviRI Nspl j 
I I 



Hgal 
I 



Dpnl 
Bglll 
BstYI 
Sau3AI 
MboII | 



ATCTTTTATTGCACATGCGGACGCCTTCCCTATTGAACAAGGAGATCTTCTTGGTCATTG 
TAGAAAATAACGTGTACGCCTGCGGAAGGGATAACTTGTTCCTCTAGAAGAACCAGTAAC 



Apol 
Tsp509I 
MboII | CviJI 

I I I 
GGTTTTTTATGACGATGAAGGCAAGAAAATTTCTTCCCAGCCTTTCTATGCCCCTTGTCG 

CCAAAAAATACTGCTACTTCCGTTCTTTTAAAGAAGGGTCGGAAAGATACGGGGAACAGC 



BsaJI 
Styl 
CviJI | 
I I 



Af 1III 
Maell 
Bbsl | 
MboII | 

I I 



TTTTGAGCGCACTATCAAGCCTTGGAAACTCTATATGAAACGTGTCTTCACATCGTATAG 
AAAACTCGCGTGATAGTTCGGAACCTTTGAGATATACTTTGCACAGAAGTGTAGCATATC 



Bbvl 

I 



Nlalll 
Fokl 
Fnu4HI | 
Nlalll | | 
Tselj j 
I I 



Acil 
Mwol 
SfaNI | 
Cac8I | j 
Acil | | | 
I III 



AACCTATATGTCTATAACCATGCTGCTCATGTATTTTCGCATCCGCAAGCACCGCAAGTA 

1321 + + + + h- + 1380 

TTGGATATACAGATATTGGTACGACGAGTACATAAAAGCGTAGGCGTTCGTGGCGTTCAT 



Apol 

Tsp509I Dral Tsp509I 
Tthlllll |Msel| Msel | 

I I II M 
TAAAAATTTAAAAC ACT AT T CT AAAAT C T AACTTTTT CTTTT AAT TT AT AAAAAAC C AAA 

1381 + + + + + + 1440 

ATTTTTAAATTTTGTGATAAGATTTTAGATTGAAAAAGAAAATTAAATATTTTTTGGTTT 

CjePI 

CjePI Hinfl Hhal | 

Hhal | Tfil Fspl|Mmel| 

II I M U 

GGTTTATGTAAGATTTGCGCTTTTCAATCCAACAAGAATCCCTTGTGCGCACATTACTTT 

1441 + + + + + + 1500 

CCAAATACATTCTAAACGCGAAAAGTTAGGTTGTTCTTAGGGAACACGCGTGTAATGAAA 
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Figure 13: CPN100515- 



aaggagcaaa tggagattgg ccaaatagac gagcaagggt ttgcataaga atagcctttt 60 

tcgcaataat aacttgccta aacgatcttg taaacgactt atg get tct aat ccc 115 

Met Ala Ser Asn Pro 

1 5 

att tta cag ata gag gat eta tec ata ace ttg gca aaa caa cgc caa 163 

lie Leu Gin lie Glu Asp Leu Ser lie Thr Leu Ala Lys Gin Arg Gin 

lie Leu Gin lie Glu Asd Leu Ser lie Thr Leu Ala Lys Gin Arg Gin 
10 15 20 



cag tac ccc ate gtc caa tct tta teg ttt act ate aat gaa gga caa 211 

Gin Tyr Pro lie Val Gin Ser Leu Ser Phe Thr lie Asn Glu Gly Gin 

Gin Tyr Pro lie Val Gin Ser Leu Ser Phe Thr lie Asn Glu Gly Gin 

25 30 35 



acc tta gca ate att gga gaa tea gga tea gga aaa tct gtc tct gcg 25 9 

Thr Leu Ala He He Gly Glu Ser Gly Ser Gly Lys Ser Val Ser Ala 

Thr Leu Ala He lie Gly Glu Ser Gly Ser Glv Lys Ser Val Ser Ala 
40 45 50 



cat gca ate ctt cga tta ctt cct tgc ccc cca ttt tct gtt tct ggc 3C7 

His Ala He Leu Arg Leu Leu Pro Cys Pro Pro Phe Ser Val Ser Gly 

His Ala He Leu Arg Leu Leu Pro Cys Pro Pro Phe Ser Val Ser Glv 

55 60 65 



cag gtc aac ttc caa ggc cac aac tta ctt acg get teg cgc tct ata 

Gin Val Asn Phe Gin Gly His Asn Leu Leu Thr Ala Ser Arg Ser lie 

Gin Val Asn Phe Gin Gly His Asn Leu Leu Thr Ala Ser Arg Ser lie 

70 75 80 85 



caa aaa aag att ata ggg aca gaa att tct atg ate ttt caa aac ccg 403 

Gin Lys Lys He He Gly Thr Glu lie Ser Met lie Phe Gin Asn Pro 

Gin Lys Lys He He Gly Thr Giu He Ser Met lie Phe Gin Asn Pro 
90 95 100 



caa gca tct eta aac ccc gtg ttt act att gaa cag cag ttt cga gaa 
Gin Ala Ser Leu Asn Pro Val Phe Thr lie Glu Gin Gin Phe Arg Glu 
Gin Ala Ser Leu Asn Pro Val Phe Thr lie Glu Gin Gin Phe Arg Glu 
105 110 115 

att att cat acc cac eta gec tta act gca gaa gtt get aaa gaa aag 

He lie His Thr His Leu Ala Leu Thr Ala Glu Val Ala Lys Glu Lys 

He lie His Thr His Leu Ala Leu Thr Ala Glu Val Ala Lys Glu Lys 

120 125 130 

atg tta tac get ctt gaa gaa aca ggg ttt cat gat ccc agg ctg tgc 

Met Leu Tyr Ala Leu Giu Giu Thr Gly Phe His Asp Pro Arg Leu Cys 

Met Leu Tyr Ala Leu Giu Giu Thr Gly Phe His Aso Pro Arg Leu Cys 

135 140 145 



ttg aat etc tac 

Leu Asn Leu Tyr 

Leu Asn Leu Tyr 
150 



ccc cac caa etc 

Pro His Gin Leu 

Pro His Gin Leu 
155 



tot gga ggg atg 
Ser Gly Gly Met 
Ser Gly Gly Met 
160 



Leu Gin Arg lie 
Leu Gin Arg He 
165 
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tgc att gcc atg gcg etc etc tgt tct cct aaa ctt ctt att get gat 643 

Cys He Ala Met Ala Leu Leu Cys Ser Pro Lys Leu Leu He Ala Asp 

Cys He Ala Met Ala Leu Leu Cys Ser Pro Lys Leu Leu lie Ala Asp 

170 175 180 



gaa cct acg act get tta gat gtt tct gtt cag tat cag att eta caa 

Glu Pro Thr Thr Ala Leu Asp Val Ser Val Gin Tyr Gin He Leu n n 

Glu Pro Thr Thr Ala Leu Asp Val Ser Val Gin Tyr Gin He Leu Gin 
185 190 295 

tta eta aaa aca eta cag aaa aaa acg gga atg age ctt ctt att at*- 

Leu Leu Lys Thr Leu Gin Lys Lys Thr Gly Met Ser Leu Leu He lie 

Leu Leu Lys Thr Leu Gin Lys Lys Thr Gly Met Ser Leu Leu He He 
200 205 210 

acc cat aat atg gga gtc gtt gca gaa act get gat gac gtg etc oto 

Thr His Asn Met Gly Val Val Ala Glu Thr Ala Asp Asp Val Leu Val 

Thr His Asn Met Gly Val Val Ala Glu Thr Ala Asp Asp Val Leu Val 
215 220 225 

f tC £ at ?? a 9 ? a CgC 3tg gta gaa tgt 9 CC cct gcg gtt caa atg ttc 
Leu Tyr Ala Gly Arg Met Val Glu Cys Ala Pro Ala Val Gin Me? Phe 
Leu Tyr Ala Gly Arg Met Val Glu Cys Ala Pro Ala Val Gin Met PhJ 
30 235 240 245 

cat aat cct tct cat ccc tat acc cga gat ctt tta gca tc~ aca e-~ 
His Asn Pro Ser His Pro Tyr Thr Arg Asp Leu Leu Ala Ser" Arg P-o 
His Asn Pro Ser His Pro Tyr Thr Arg Asp Leu Leu Ala Ser Ar? Pro 
250 255 260 

tct eta caa ccg caa caa eta ggt tee ttc aac ccc att cca gga eag 
Ser Leu Gin Pro Gin Gin Leu Gly Ser Phe Asn Pro He Pro Glv Gin 



He Pro Gly Gin 
275 



Ser Leu Gin Pro Gin Gin Leu Gly Ser Phe Asn P*-o 
265 270 

ccc cca cac tac acg gcc ttt ccc teg gga tgt cgc tat cac cct aaa 
Pro Pro His Tyr Thr Ala Phe Pro Ser Gly Cys Arg Tyr alt B~o A?a 

Pr ° ?«« Thr Ala PhS Pr ° Ser <* s His Pro A?I 

280 235 290 

tgc tea aaa att tta aat cga tgt tct gcg gaa get cca gaa ate tat 

Cys Ser Lys He Leu Asn Arg Cys Ser Ala Glu Ala Pro Glu He Tvr 

Cys Ser Lys He Leu Asn Arg Cys Ser Ala Glu Ala Pro Glu 11* T Jr 
295 300 

ccg gta cgc gaa ggt cac aaa gta agg gtt ggc tgt atg aca ac+- aaf 
Pro Val Arg Glu Gly His Lys Val Arg Val Gly Cys Met ?S ?^ Asn 
Pro Val Arg Glu Gly His Lys Val Arg Val Gly Cys Met Thr Sr Asn 



315 320 



325 



ttt ccc caa cct tt, 
Phe Pn 



Phe Pro Gin Pro Leu Ile-S HI ?£ S e = r Leu Sr tyl Ss ? y = 
Phe Pro Gin Pro Leu lie Gin Ala Thr Ser Lau Thr Lys Hif T?r Tvr 



335 — — IVo 
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aag cgt tec ttt tgg ttt cag gga aag aca att gec agt cgt cct gtt 1171 
Lys Arg Ser Phe Trp Phe Gin Gly Lys Thr lie Ala Ser Arg Pro Val 
Lvs A*-q Se 1 - Phe Trp Phe Gin Gly Lys Thr He Ala Ser Arg Pro Val 
345 350 355 

gac gac gtc tct ttt tea eta tac tec aga cgt get gtc gga ctt att 1219 

Asp Asp Val Ser Phe Ser Leu Tyr Ser Arg Arg Ala Val Gly Leu He 

Asd Asp Val Ser Phe Ser Leu Tyr Ser Arg Arg Ala Val Gly Leu lie 
360 365 370 

gga gaa tct gga tea ggg aaa agt ace ctg gcg tta get etc gca ggt 12 67 

Gly Glu Ser Gly Ser Gly Lys Ser Thr Leu Ala Leu Ala Leu Ala Gly 

Gly Glu Ser Gly Ser Gly Lys Ser Thr Leu Ala Leu Ala Leu Ala Gly 

375 380 385 

etc eta cct etc acc tct ggg ttc tta act ttt aac ggc acc cca ate 1315 

Leu Leu P-o Leu Thr Ser Gly Phe Leu Thr Phe Asn Gly Thr Pro lie 

Leu Leu Pro Leu Thr Ser Gly Phe Leu Thr Phe Asn Gly Thr Pro He 
390 395 400 405 

aag ttg cat tct aaa cac gga cgc cat caa tta cga tct caa gta egg 1363 

Lys Leu His Ser Lys His Gly Arg His Gin Leu Arg Ser Gin Val Arg 

Lvs Leu His Ser Lys His Gly Arg His Gin Leu Arg Ser Gin Val Arg 

* 410 415 420 

ttg gtc ttt caa aat cca caa get tea tta aac ceg cga aaa act ate 1411 

Leu Val Phe Gin Asn Pro Gin Ala Ser Leu Asn Pro Arg Lys Thr He 

Leu Val Phe Gin Asn Pro Gin Ala Ser Leu Asn Pro Arg Lys Thr He 

425 430 435 

eta gat agt tta ggc cac tct ctg ctt tac cat aaa etc gtc cca aaa 1459 

Leu Asp Ser Leu Gly His Ser Leu Leu Tyr His Lys Leu Val Pro Lys 

Leu Asd Ser Leu Gly His Ser Leu Leu Tyr His Lys Leu Val Pro Lys 
" 440 445 450 

gaa aaa gta eta gca acg gta agg gaa tat tta gaa ttg gta ggg tta 1507 

Glu Lys Val Leu Ala Thr Val Arg Glu Tyr Leu Glu Leu Val Gly Leu 

Glu Lys Val Leu Ala Thr Val Arg Glu Tyr Leu Glu Leu Val Gly Leu 

455 460 465 

tct gag gag tat ttt tat cgt tat cct cac cag ctt tct gga gga caa 1555 

Ser Glu Glu Tyr Phe Tyr Arg Tyr Pro His Gin Leu Ser Gly Gly Gin 

Se*- Glu Glu Tyr Phe Tyr Arg Tyr Pro His Gin Leu Ser Gly Gly Gin 
470 475 480 485 

caa caa cca gtc tct ata gcg aga gec eta tta gga gtc cct cag tta 1603 

Gin Gin Arg Val Ser lie Ala Arg Ala Leu Leu Gly Val Pro Gin Leu 

Gin Gin Arg Val Ser lie Ala Arg Ala Leu Leu Gly Val Pro Gin Leu 

490 495 500 

att att tgt gac gaa att gtt tct get eta gat tta tct att caa gca 1651 

He lie Cys Asp Glu lie Val Ser Ala Leu Asp Leu Ser lie Gin Ala 

r e He Cvs Asd Glu He Val Ser Ala Leu Asd Leu Ser He Gin Ala 

" 505 510 515 

caa att ctg aat atg ctt gec gag ctg caa aaa aaa etc age etc aca 1699 

Gin He Leu Asn Met Leu Ala Glu Leu Gin Lys Lys Leu Ser Leu Thr 

Gin lie Leu Asn Met Leu Ala Glu Leu Gin Lys Lys Leu Ser Leu Thr 

520 525 530 
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tat etc ttc att teg cat gat ctt gec gtt gta cgc teg ttc tgc aca 1747 

Tyr Leu Phe lie Ser His Asp Leu Ala Val Val Arg Ser Phe Cys Thr 

Tyr Leu Phe lie Ser His Asp Leu Ala Val Val Arg Ser Phe Cys Thr 

535 540 545 

gag gta ttc att atg tat aag ggg caa att gta gaa aaa gga aat aca 1795 
Glu Val Phe lie Met Tyr Lys Gly Gin lie Val Glu Lys Gly Asn Thr 
Glu Val Phe lie Met Tyr Lys Gly Gin lie Val Glu Lys Gly Asn Thr 
550 555 560 565 

aaa cgc att ttt tct gat cca caa cat cct tat acg cgc atg ttg tta 1843 

Lys Arg lie Phe Ser Asp Pro Gin His Pro Tyr Thr Arg Met Leu Leu 

Lys Arg lie Phe Ser Asp Pro Gin His Pro Tyr Thr Arg Met Leu Leu 

570 575 580 

aat gec caa ctt cca gag act cct gat caa agg caa tct aaa cct ata 1891 

Asn Ala Gin Leu Pro Glu Thr Pro Asp Gin Arg Gin Ser Lys Pro lie 

Asn Ala Gin Leu Pro Glu Thr Pro Asp Gin Arg Gin 

585 590 595 

ttc caa gaa tat cac aaa gat tct gaa gaa tct tgc tct aca gga tgc 1939 
Phe Gin Glu Tyr His Lys Asp Ser Glu Glu Ser Cys Ser Thr Glv Cv. 

600 605 610 y 

tac ttt tac aat cgt tgt cca caa aaa caa gaa get tgc aag tea aaa 1 QH7 
Tyr Phe Tyr Asn Arg Cys Pro Gin Lys Gin Glu Ala Cys Lys Ser Glu 
615 620 525 

ate ate cca aat caa gga gac gcg cac cat aca tac cgt tgt ate cat 2035 
He He Pro Asn Gin Gly Asp Ala His His Thr Tyr Arg Cys He His 
630 635 640 645 

tgattegtec tetaegctat tcttaagcta ccattaagga atcccaaggg agaggtctgc 20 95 

2100 
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Figure 14(RY-40) 

Restriction enzyme analysis of CPN 100515 

CviJI 
Hael 
Haelll 

MSCl 

Eael I CviRI CviJI Mwol 

II III 
AAGGAGCAAATGGAGATTGGCCAAATAGACGAGCAAGGGTTTGCATAAGAATAGCCTTTT 

x + + + + + + 

TTCCTCGTTTACCTCTAACCGGTTTATCTGCTCGTTCCCAAACGTATTCTTATCGGAAAA 

Dpnl 
Cjel| 

Sau3Al| | CviJI Cjel 

III I I 

TCGCAATAATAACTTGCCTAAACGATCTTGTAAACGACTTATGGCTTCTAATCCCATTTT 

61 + + + + + + 

AGCGTTATTATTGAACGGATTTGCTAGAACATTTGCTGAATACCGAAGATTAGGGTAAAA 



Dpnl 
Hin4I 
BstYI 
Sau3AI 
HaelV | 
Mnll Hin4I | 
I I I 



BstXI 
BsaJI | 
Alwl Styl j 

I I I 



Bed 
CjePI | 
Rsal | j 
Taal | | | 
III 



121 



ACAGATAGAGGATCTATCCATAACCTTGGCAAAACAACGCCAACAGTACCCCATCGTCCA 

TGTCTATCTCCTAGATAGGTATTGGAACCGTTTTGTTGCGGTTGTCATGGGGTAGCAGGT 

Dpnl 
Sau3AI 
Hpyl78III | 
BpulOI Hinf I | j 

CjePI Ddel Tfil | j 

II I M 

ATCTTTATCGTTTACTATCAATGAAGGACAAACCTTAGCAATCATTGGAGAATCAGGATC 

i + + + + h 

TAGAAATAGCAAATGATAGTTACTTCCTGTTTGGAATCGTTAGTAACCTCTTAGTCCTAG 



Hpyl78III 
I 



CviRI 
Nlalll 
Nspl 
SphI 
Cac8I 
Hhal | 
Fsplj | 
BsmAI | | | 
I II I 



AGGAAAATCTGTCTCTGCGCATGCAATCCTTCGATTACTTCCTTGCCCCCCATTTTCTGT 

241 + + + + + + 300 

TCCTTTTAGACAGAGACGCGTACGTTAGGAAGCTAATGAAGGAACGGGGGGTAAAAGACA 
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CviJI 
Hael 
Haelll 
BsaJI 
Cjel 
Hindi 



ScrFI 
CviJI 
EcoRII 
Hael 
Haelll 
MscI 
Eael | 
I I 



Styl 
I 



Hhal 

CviJI Thai | Beef I 
III I 



TTCTGGCCAGGTCAACTTCCAAGGCCACAACTTACTTACGGCTTCGCGCTCTATACAAAA 
AAGACCGGTCCAGTTGAAGGTTCCGGTGTTGAATGAATGCCGAAGCGCGAGATATGTTTT 



Apol 
Tsp509I 
I 



Dpnl 
Sau3AI j 
BsmFI | | 

I I I 



Faul 
Sthl32I | 
Cac8I | j 
Acil | | | 
I I II 



BscGI 
Tthlllll 
SfaNI | 
I 



AAAG ATT AT AGGG AC AG AAAT TTCTATGATCTTT C AAAAC C CG C AAG CAT CTC TAAAC C C 



TTTCTAATATCCCTGTCTTTAAAGATACTAGAAAGTTTTGGGCGTTCGTAGAGATTTGGG 

CviRI 
Sfd 
Mwol 
Msel | 
CviJI | | 
Bfal | | | 
I I I 



XmnI 
Tsp509I | 
Hpyl78III | | 
Taql | | | 

Mil 



CGTGTTTACTATTGAACAGCAGTTTCGAGAAATTATTCATACCCACCTAGCCTTAACTGC 



GCACAAATGATAACTTGTCGTCAAAGCTCTTTAATAAGTATGGGTGGATCGGAATTGACG 



ScrFI 
BsaJI 
EcoRII 
Tthlllll 

Dpnl 
Nlalll ' 
Sau3AI 
Hpyl78III 

BstAPI Real j 

Mwol Alwl | | 

PstI I Hpyl78III MboII | | 

II I IM 

AGAAGTTGCTAAAGAAAAGATGTTATACGCTCTTGAAGAAACAGGGTTTCATGATCCCAG 



TCTTCAACGATTTCTTTTCTACAATATGCGAGAACTTCTTTGTCCCAAAGTACTAGGGTC 
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CjePI 
Hinfl | 

Tf il | 

I I 



Hpyl78III 

BstXI 
SfaNI | 
Mnll | j 
Xcml | | 
II ■ 



BsrDI 
CviRI 
Bpml 
Fokl 
Apol | 
Tsp509I | 
CjePI || 
I I 



GCTGTGCTTGAATCTCTACCCCCACCAACTCTCTGGAGGGATGCTTCAAAGAATTTGCAT 
CGACACGAACTTAGAGATGGGGGTGGTTGAGAGACCTCCCTACGAAGTTTCTTAAACGTA 



Haell 
Hhal 
Nlalll 
BsaJI 
BstDSI 
Ncol 
Styl 
BseRI | 

MslI I 111 Mnll Pf 111081 

II I II I I 

TGCCATGGCGCTCCTCTGTTCTCCTAAACTTCTTATTGCTGATGAACCTACGACTGCTTT 

01 + + + + + + 

ACGGTACCGCGAGGAGACAAGAGGATTTGAAGAATAACGACTACTTGGATGCTGACGAAA 

Hinfl 

Tfil Sthl32I 
Hpyl88IX |Tsp509I Sfcl | BscGI 

III II I 

AGATGTTTCTGTTCAGTATCAGATTCTACAATTACTAAAAACACTACAGAAAAAAACGGG 

61 + + + + + + 

T CTAC AAAGAC AAGT CAT AG TCTAAG ATGTT AATG AT TTTTG' 



BslI 
Pf 1MI 
I 



AlwNI 
BstAPI 
CviRI j 
Plel Mwol 



Maell 
I 



AATGAGCCTTCTTATTATTACCCATAATATGGGAGTCGTTGCAGAAACTGCTGATGACGT 
TTACTCGGAAGAATAATAATGGGTATTATACCCTCAGCAACGTCTTTGACGACTACTGCA 



BsiHKAI 
Bspl286I 

BsiHKAI | Bspl286I 

Bspl286I j Hgal Bmgl | 

BssSl| j CviRI Nlalll | BseSI |AciI CjePI Fokl 

II I I II 11 I I I 
GCTCGTGCTCTATGCAGGACGCATGGTAGAATGTGCCCCTGCGGTTCAAATGTTCCATAA 



781 



CGAGCACGAGATACGTCCTGCGTACCATCTTACACGGGGACGCCAAGTTTACAAGGTATT 
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CjePI 
Sthl32I 
Dpnl 
Bglll 
BstYI 
Sau3AI 
CjePI | 
Fokl | j 
Aval | | | 
III I 



I I I 



Acil 
Mnll j 
I I 



TCCTTCTCATCCCTATACCCGAGATCTTTTAGCATCCAGACCCTCTCTACAACCGCAACA 
AGGAAGAGTAGGGATATGGGCTCTAGAAAATCGTAGGTCTGGGAGAGATGTTGGCGTTGT 



Beef I 
Hpyl78III 

Aval 
BsaJI 



Bf al 
CjePI |NlaIV 

I I I 



ScrFI 
BslI | 
EcoRII | j 
I I I 



CviJI 
I 



RleAI 
Sthl32I | 
CviJI | j 
HaeIIl| | 



ACTAGGTTCCTTCAACCCCATTCCAGGACAGCCCCCACACTACACGGCCTTTCCCTCGGG 

+ + + + H + 

TGATCCAAGGAAGTTGGGGTAAGGTCCTGTCGGGGGTGTGATGTGCCGGAAAGGGAGCCC 



SfaNI 
Mnll | Bfal 
HphI | j Fokl | 
III'' 



Bpml 
Clal I 
Apol Dral | j 
Tsp509I MseljTaql j 
I I i II 



Hpyl78III 
Alul | 
Acil CviJI j 
i I 



ATGTCGCTATCACCCTAGATGCTCAAAAATTTTAAATCGATGTTCTGCGGAAGCTCCAGA 

TACAGCGATAGTGGGATCTACGAGTTTTTAAAATTTAGCTACAAGACGCCTTCGAGGTCT 

Maelll 
Tsp45I 
Mspl Thai | 
BsaWl|RsaI | | CviJI Tsp509I 

II I I I I I 

AATCTATCCGGTACGCGAAGGTCACAAAGTAAGGGTTGGCTGTATGACGACTAATTTTCC 

TTAGATAGGCCATGCGCTTCCAGTGTTTCATTCCCAACCGACATACTGCTGATTAAAAGG 

Mnll 

Tsp509I Tthlllll| 
Msel | Msel | | 

II I M 

CCAACCTTTAATTCAAGCAACCTCATTAACAAAGCACTATTACAAGCGTTCCTTTTGGTT 

GGTTGGAAATTAAGTTCGTTGGAGTAATTGTTTCGTGATAATGTTCGCAAGGAAAACCAA 
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Muni 
Tsp509I 
I 



BsmAI 
BsmBI 
Bpml 
Aatll 
BsaHI 
Hin4I 
Mae 1 1 
Tthllll | 
Hindi | | 
I I I 



TCAGGGAAAGACAATTGCCAGTCGTCCTGTTGACGACGTCTCTTTTTCACTATACTCCAG 

+ + |- + + + 

AGTCCCTTTCTGTTAACGGTCAGCAGGACAACTGCTGCAGAGAAAAAGTGATATGAGGTC 



Hpyl8 8IX 
Ahdl 
Haeiv| 
Hin4l | 
Maell | | 
I I 



Dpnl 
Sau3AI 
Hpyl78III | 
Hinf I | j 

Tfil | | 

I I I 



ScrFI 
BsaJI | 
EcoRII j 
Rsal | | 

I I 



Alul 
CviJI 
BspMI | 
I i 



1201 



ACGTGCTGTCGGACTTATTGGAGAATCTGGATCAGGGAAAAGTACCCTGGCGTTAGCTCT 
TGCACGACAGCCTGAATAACCTCTTAGACCTAGTCCCTTTTCATGGGACCGCAATCGAGA 



Bsal 

BsmAI Msel NlalV 

Hphll Mnll Mnll | Msel BanI | Beef I 

II I M I I I I 

CGCAGGTCTCCTACCTCTCACCTCTGGGTTCTTAACTTTTAACGGCACCCCAATCAAGTT 

GCGTCCAGAGGATGGAGAGTGGAGACCCAAGAATTGAAAATTGCCGTGGGGTTAGTTCAA 



BsmI 
CviRI 
I 



Smll 
Dpnl 
Sau3AI 
Hgal | 
Tsp509l| j 
Bed | 1 | 
Bce83I IN | 
BsaHI | j II I 
I I II 



Taal 
Rsal j 

I 



GCATTCTAAACACGGACGCCATCAATTACGATCTCAAGTACGGTTGGTCTTTCAAAATCC 

1321 + + + + + + 138° 

CGTAAGATTTGTGCCTGCGGTAGTTAATGCTAGAGTTCATGCCAACCAGAAAGTTTTAGG 



Alul 
CviJI 
Hindu I j 
I I 



Faul 
Sthl32I | 
Thai I 



Msel Acil 



I I 



CviJI 
Hael 
Haelll 
I 



ACAAGCTTCATTAAACCCGCGAAAAACTATCCTAGATAGTTTAGGCCACTCTCTGCTTTA 
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Bf al 
Rsal | 
Seal j 
Tat I | j 

I I I 



Taal 
I 



Tsp509I 
Sspl | BseMII 

I I 



CCATAAACTCGTCCCAAAAGAAAAAGTACTAGCAACGGTAAGGGAATATTTAGAATTGGT 

1441 + + + + + + 

GGTATTTGAGCAGGGTTTTCTTTTTCATGATCGTTGCCATTCCCTTATAAATCTTAACCA 

Hpyl78III 
BstXI 
Mnll 

Ddel Mnll | 

Hpyl88IX BseRI Alul j | 

Mnll | HphI CviJlj j 

II I Ml 

AGGGTTATCTGAGGAGTATTTTTATCGTTATCCTCACCAGCTTTCTGGAGGACAACAACA 

1501 + + + + + + 1560 

TCCCAATAGACTCCTCATAAAAATAGCAATAGGAGTGGTCGAAAGACCTCCTGTTGTTGT 

BseMII 
Maelll 
Tsp4 5I 
Mnll 



Plel 
BsmAI 
Sfcl | 
Bpml j j 
Hinf I | I | 
I I I I 



Banll 
Bspl286I 
CviJI | 
BsmFI | j 
I I 



Tsp509I 
Msel | 
Plel | j 
Hinfl Ddel | j j 
I MM 



Tsp509I 
I 



ACGAGTCTCTATAGCGAGAGCCCTATTAGGAGTCCCTCAGTTAATTATTTGTGACGAAAT 
TGCTCAGAGATATCGCTCTCGGGATAATCCTCAGGGAGTCAATTAATAAACACTGCTTTA 



Hpyl7 8III 
Bfal | 
Xbal I j 

III 



Bbvl 
Tthlllll 
Acelll 
Hpyl8 8IX 
Tthlllll j 
Apol | | 
Tsp509I | | 
I i ■ 



CviRI 
Fnu4HI 
Alul | 
CviJI j 
Tsel | 
Mwol | j 
Cac8I | | 1 
I III 



TGTTTCTGCTCTAGATTTATCTATTCAAGCACAAATTCTGAATATGCTTGCCGAGCTGCA 

1621 + + + + + + 1680 

ACAAAGACGAGATCTAAATAGATAAGTTCGTGTTTAAGACTTATACGAACGGCTCGACGT 



MboII Dpnl 
CviJI | Mnll Beef I NlaIIl| 
Ddel | j BseMII | Earl | Sau3AI | | Bsgl Rsal Mwol 

I I I II II III II I 

AAAAAAACTCAGCCTCACATATCTCTTCATTTCGCATGATCTTGCCGTTGTACGCTCGTT 

1681 + + + + + + 1740 

TTTTTTTGAGTCGGAGTGTATAGAGAAGTAAAGCGTACTAGAACGGCAACATGCGAGCAA 
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CviRI 

Mill I | Tsp509I 
I I I 

CTGCACAGAGGTATTCATTATGTATAAGGGGCAAATTGTAGAAAAAGGAAATACAAAACG 

1741 + + + + + + 1800 

GACGTGTCTCCATAAGTAATACATATTCCCCGTTTAACATCTTTTTCCTTTATGTTTTGC 



Dpnl 
Sau3AI 
Hpyl8 8IX | 
Fokl i j 
Alwl | j j 



Nlalll 
Hhal |MseI 
Thai |NspI | 
I I I I 



Hpyl78III 
Plel | 
BsmAI | | 

I I 



CATTTTTTCTGATCCACAACATCCTTATACGCGCATGTTGTTAAATGCCCAACTTCCAGA 

1801 + + + + + + 1860 

GTAAAAAAGACTAGGTGTTGTAGGAATATGCGCGTACAACAATTTACGGGTTGAAGGTCT 



Dpnl 
Bell 
Sau3AI 
Hpyl78III | 

HaeIV| j | H P yl88IX 
Hin4l|| | Hinfl |HinfI 

Hinf I Ml | Tfil | Tf il 

I Ml I III 

GACTCCTGATCAAAGGCAATCTAAACCTATATTCCAAGAATATCACAAAGATTCTGAAGA 

1861 + + + + + + 

CTGAGGACTAGTTTCCGTTAGATTTGGATATAAGGTTCTTATAGTGTTTCTAAGACTTCT 



CviRI 
Fokl | 
Cac8I | 

Sfcl Alul | j 

MboII | CviJI j | 

SfaNI | Eco57I Fokl Hindi I I | || 

I II I I I I II 
ATCTTGCTCTACAGGATGCTACTTTTACAATCGTTGTCCACAAAAACAAGAAGCTTGCAA 
1 Q->1 J J- * + + + 1980 



Dpnl BslI Bael BciVI 

Sau3AI | BsmAI | Hhal Hinfl | 

Hpyl88IX | | BsmBI | Thai jHgal Taal Tfil | 

III II II I I I I 

GTCAGAGATCATCCCAAATCAAGGAGACGCGCACCATACATACCGTTGTATCCATTGATT 

1981 + + + + + + 2040 

CAGTCTCTAGTAGGGTTTAGTTCCTCTGCGCGTGGTATGTATGGCAACATAGGTAACTAA 
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Bael 
Hin4I | 
I I 



Alul 
CviJI 
Msel | 
Aflll| I 
Smll | j 
Mnll | I I 
I I 



Mnll 
BsaJI | 
Hinf I | | 
Msel Tfil Stylj 
I I 



CGTCCTCTACGCTATTCTTAAGCTACCATTAAGGAATCCCAAGGGAGAGGTCTGCTCTAT 
GCAGGAGATGCGATAAGAATTCGATGGTAATTCCTTAGGGTTCCCTCTCCAGACGAGATA 
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cgaagagcaa acctccacag ttacagagaa agacgtccaa cctaaaacac aagcaacacc 60 

acacgcttcg aagaaaaacg ttgcaagtcc ttcgacctct atg cca gga ate gag 115 

Met Pro Gly lie Glu 
1 5 

aaa gca gca aca aca gtg get gta cct caa gac aaa tct gaa gaa gaa 163 
Lys Ala Ala Thr Thr Val Ala Val Pro Gin Asp Lys Ser Glu Glu Glu 
10 15 20 

aaa gtt aaa gag cga ttg aca aag egg gaa ctt ace tgt gaa gac ctt 211 
Lys Val Lys Glu Arg Leu Thr Lys Arg Glu Leu Thr Cys Glu Asp Leu 
25 30 35 

aaa gat aac ggc tat act gtc aat ttt gaa gac att tct att tta gag 259 
Lys Asp Asn Gly Tyr Thr Val Asn Phe Glu Asp lie Ser lie Leu Glu 
40 45 50 

ttg ttg cag ttc gta agt aaa att tct gga acg aac ttt gtc ttt gat 3 07 
Leu Leu Gin Phe Val Ser Lys He Ser Gly Thr Asn Phe Val Phe Asp 
55 60 65 

age aac gat ttg caa ttc aat gtc acg ate gtt tec cac gat cct act 3 55 
Ser Asn Asp Leu Gin Phe Asn Val Thr He Val Ser His Asp Pro Thr 
70 75 80 85 

tct gta gat gat tta tct aca ate tta eta caa gtc tta aaa atg cat 403 
Ser Val Asp Asp Leu Ser Thr He Leu Leu Gin Val Leu Lys Met His 
90 95 100 

gac ttg aag gtt gtt gaa caa ggc aat aac gtc ctt ate tat cgt aat 451 
Asp Leu Lys Val Val Glu Gin Gly Asn Asn Val Leu He Tyr Arg Asn 
105 HO 115 

cct cat ctt tct aag eta tec aca gta gtc aca gac age tec tta aaa 499 
Pro His Leu Ser Lys Leu Ser Thr Val Val Thr Asp Ser Ser Leu Lys 
120 125 130 

gaa acg tgt gaa get gtt gtg gtt acc cga gtg ttc cgt ctt tac agg 547 
Glu Thr Cys Glu Ala Val Val Val Thr Arg Val Phe Arg Leu Tyr Arg 
135 140 145 

cgt cag ccc tct gca gca gta aat att att caa cct tta ctt tec cat 595 
Arg Gin Pro Ser Ala Ala Val Asn He lie Gin Pro Leu Leu Ser His 
150 155 160 165 

gat get ate gtt agt get tea gaa get act cgt cat gtt ate ate teg 643 
Asp Ala He Val Ser Ala Ser Glu Ala Thr Arg His Val He He Ser 
170 175 180 

gat att get ggt aat gtc gat aaa gtc agt gat ttg eta gca get eta 691 
Asp He Ala Gly Asn Val Asp Lys Val Ser Asp Leu Leu Ala Ala Leu 
185 190 195 
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gat tgc cca ggc aca tct gtg gac atg act gaa tac gaa gtt aaa tat 

Asp Cys Pro Gly Thr Ser Val Asp Met Thr Glu Tyr Glu Val Lys Tyr 

200 205 210 

gcc aat ccc gca get ctt gtt age tac tgc caa gat gtt ctt ggt act 

Ala Asn Pro Ala Ala Leu Val Ser Tyr Cys Gin Asp Val Leu Gly Thr 

215 220 225 

ctg gcc gaa gat gat get ttc caa atg ttc ate caa cct gga acg aac 

Leu Ala Glu Asp Asp Ala Phe Gin Met Phe lie Gin Pro Gly Thr Asn 

230 235 240 245 

aaa att ttc gtc gtc tct tea cca cgt ctt gca aat aag gca gag cag 

Lys lie Phe Val Val Ser Ser Pro Arg Leu Ala Asn Lys Ala Glu Gin 

250 255 260 

etc ctg aag tec tta gat gtc cca gaa atg gca cat acc eta gat gat 

Leu Leu Lys Ser Leu Asp Val Pro Glu Met Ala His Thr Leu Asp Asp 

265 270 275 

cct gca agt act gcc ttg get ttg gga gga aca gga acc acg age cct 

Pro Ala Ser Thr Ala Leu Ala Leu Gly Gly Thr Gly Thr Thr Ser Pro 

280 285 290 

aag agt ttg egg ttc ttt atg tac aag ctg aag tat caa aat gga gaa 

Lys Ser Leu Arg Phe Phe Met Tyr Lys Leu Lys Tyr Gin Asn Gly Glu 

295 300 305 

gtg att get aat gcc etc caa gat ate ggt tac aat eta tat gta acc 

Val lie Ala Asn Ala Leu Gin Asp lie Gly Tyr Asn Leu Tyr Val Thr 

310 315 320 325 

aca get atg gac gaa gat ttc att aac act etc aat agt ate cag tgg 

Thr Ala Met Asp Glu Asp Phe lie Asn Thr Leu Asn Ser lie Gin Trp 

330 335 340 

tta gag gtc aat aac tec ata gtt att ate gga aac caa ggg aat gtc 

Leu Glu Val Asn Asn Ser lie Val lie lie Gly Asn Gin Gly Asn Val 

345 350 355 

gac aga gtt att ggc etc tta aac ggt tta gat tta cct cct aaa cag 

Asp Arg Val lie Gly Leu Leu Asn Gly Leu Asp Leu Pro Pro Lys Gin 

360 365 370 

gtt tac ate gaa gtt tta att eta gat acc age tta gag aaa tec tgg 

Val Tyr lie Glu Val Leu lie Leu Asp Thr Ser Leu Glu Lys Ser Trp 

375 380 385 

gac ttt gga gtg caa tgg gta gcc eta ggt gat gaa caa agt aaa gta 

Asp Phe Gly Val Gin Trp Val Ala Leu Gly Asp Glu Gin Ser Lys Val 

390 395 400 405 
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Fig. 15 con't) 

get tat get tct gga eta ttg aat aat act ggc ata gec aca cct aca 

Ala Tyr Ala Ser Gly Leu Leu Asn Asn Thr Gly lie Ala Thr Pro Thr 

410 415 420 

aaa gca act gtc cct ccc ggc acg cca aat cct ggt teg ate cct ctt 

Lys Ala Thr Val Pro Pro Gly Thr Pro Asn Pro Gly Ser lie Pro Leu 

425 430 435 

cct acg cca gga caa ttg aca ggg ttc tea gat atg ctg aac tct teg 

Pro Thr Pro Gly Gin Leu Thr Gly Phe Ser Asp Met Leu Asn Ser Ser 

440 445 450 

tea gca ttc ggt eta gga ate ate gga aat gtc eta agt cat aaa ggg 

Ser Ala Phe Gly Leu Gly lie lie Gly Asn Val Leu Ser His Lys Gly 

455 460 465 

aag tct ttc ctt act ttg gga ggc tta tta agt gee tta gat caa gat 

Lys Ser Phe Leu Thr Leu Gly Gly Leu Leu Ser Ala Leu Asp Gin Asp 

470 475 480 485 

gga gat act gtc att gtc ttg aat cct aga ate atg get cag gat acg 

Gly Asp Thr Val lie Val Leu Asn Pro Arg lie Met Ala Gin Asp Thr 

490 495 500 

caa caa get teg ttt ttt gta ggg caa acg gtc cct tac caa act ate 

Gin Gin Ala Ser Phe Phe Val Gly Gin Thr Val Pro Tyr Gin Thr lie 

505 510 515 

aaa tac tat ate caa gaa aca gga act gta acg caa aat ate gat tat 

Lys Tyr Tyr lie Gin Glu Thr Gly Thr Val Thr Gin Asn lie Asp Tyr 

520 525 530 

gaa gat att gga gtg aac ctt gtc gtt acc tct aca gtt get ccc aac 

Glu Asp He Gly Val Asn Leu Val Val Thr Ser Thr Val Ala Pro Asn 

535 540 545 

aat gta gtt aca eta caa ate gaa cag acg ate tea gaa tta cat tec 

Asn Val Val Thr Leu Gin He Glu Gin Thr He Ser Glu Leu His Ser 

550 555 560 565 

gcg tct gga tea eta aca cct gtc aca gat aaa act tat gca gee aca 

Ala Ser Gly Ser Leu Thr Pro Val Thr Asp Lys Thr Tyr Ala Ala Thr 

570 575 580 

cgc tta caa att ccc gac ggt tgt ttc tta gtt atg agt ggg cat ate 

Arg Leu Gin He Pro Asp Gly Cys Phe Leu Val Met Ser Gly His He 

585 590 595 

aga gat aaa act aca aaa gtg gtt tea gga gtg cct ttg eta aac tec 

Arg Asp Lys Thr Thr Lys Val Val Ser Gly Val Pro Leu Leu Asn Ser 

600 605 610 
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ata cca tta att cgt ggt tta ttt age cgt acc ate gac caa agg caa 1987 
lie Pro Leu lie Arg Gly Leu Phe Ser Arg Thr lie Asp Gin Arg Gin 
615 620 625 

aaa cgc aat ate atg atg ttt att aag cct aag gtg att agt age ttt 2035 
Lys Arg Asn lie Met Met Phe lie Lys Pro Lys Val lie Ser Ser Phe 
630 635 640 645 

gaa gaa ggc act cgt gtt acc aat aag gaa gga tac aga tac aat tgg 2083 
Glu Glu Gly Thr Arg Val Thr Asn Lys Glu Gly Tyr Arg Tyr Asn Trp 
650 655 660 

gaa get gat gaa gga tec atg caa gtg gee cct cgc cat get cct gaa 2131 
Glu Ala Asp Glu Gly Ser Met Gin Val Ala Pro Arg His Ala Pro Glu 
665 670 675 

tgc caa gga cct cct tct tta cag get gaa agt gac ttt aaa ata ata 2179 
Cys Gin Gly Pro Pro Ser Leu Gin Ala Glu Ser Asp Phe Lys lie lie 
680 685 690 

gaa ata gaa get cag tagtggtata taaaagagga agatgatatt ctccgccgtg 2234 
Glu lie Glu Ala Gin 
695 

gaatagcttc tgactctgtt gcattcaggg ggaaagccaa gaagatgtag agtcggccgt 22 94 
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Figure 16(RY-41) 

Restriction enzyme analysis of CPN100538 

Mnll Aatll 
Maelll | BsaHl | 

MboII | I Mae I I I 

Hin4I | Taal | BplI | | Bsbl 

I I I I III I 

CGAAGAGCAAACCTCCACAGTTACAGAGAAAGACGTCCAACCTAAAACACAAGCAACACC 

! + + + + + + ' 

GCTTCTCGTTTGGAGGTGTCAATGTCTCTTTCTGCAGGTTGGATTTTGTGTTCGTTGTGG 



NspV 
TaqI 
Tthlllll | 
Mmel 
I 



|BcgI Maell 
III I 



CviRI 
Bcgl| 
MboII | 
Acll | | 
I I 
I I 



TaqI 

I 



Fnu4HI 
Tsel | 
Bcgl 
Hpyl78III 
TaqI 
Hinf I 



Tfi 
Bcgl 
Mnll 
ScrFI | 
EcoRII | I 
I II 



61 



ACACGCTTCGAAGAAAAACGTTGCAAGTCCTTCGACCTCTATGCCAGGAATCGAGAAAGC 
TGTGCGAAGCTTCTTTTTGCAACGTTCAGGAAGCTGGAGATACGGTCCTTAGCTCTTTCG 



Hpyl78III 
Smll | 
Rsal | j 



TspRI | j | FauI 

AlwNl| j | | Sthl32l| 

CviJI j I I | Eco57I | | 

Bbvl | | | | I MboII | | | 

Taal | III II Hpyl88IX Msel | | M 

Bce83I || II | | I Mnll | MboII | j j | ! 

I II II I I I II IM I M 

AGCAACAACAGTGGCTGTACCTCAAGACAAATCTGAAGAAGAAAAAGTTAAAGAGCGATT 

121 + + + + + + 180 

TCGTTGTTGTCACCGACATGGAGTTCTGTTTAGACTTCTTCTTTTTCAATTTCTCGCTAA 



MboII Bcefl 
Bbsl | Tsp509I | 

Acil Msel j CviJI Taal j j 

I I I I I I I 

GACAAAGCGGGAACTTACCTGTGAAGACCTTAAAGATAACGGCTATACTGTCAATTTTGA 

1 81 + + + + + + 240 

CTGTTTCGCCCTTGAATGGACACTTCTGGAATTTCTATTGCCGATATGACAGTTAAAACT 
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Fig. 16 (con't) 



Hpyl78III 

MboII Apol | 

Bbsl | CviRI Tsp509I j 

II I II 

AGACATTTCTATTTTAGAGTTGTTGCAGTTCGTAAGTAAAATTTCTGGAACGAACTTTGT 

241 + + + + + + 300 

TCTGTAAAGATAAAATCTCAACAACGTCAAGCATTCATTTTAAAGACCTTGCTTGAAACA 



BsiEI 
Pvul 
Dpnl | 
Sau3AI | j 
Hpyl7 8III | | | Dpnl 
Tsp509I Maelll | j j j Sau3AI | 

CviRI | Tsp45I || || Alwl | j Sfcl 

II I II II I I I I 

CTTTGATAGCAACGATTTGCAATTCAATGTCACGATCGTTTCCCACGATCCTACTTCTGT 

301 + + + + + + 360 

GAAACTATCGTTGCTAAACGTTAAGTTACAGTGCTAGCAAAGGGTGCTAGGATGAAGACA 



Nlalll 

HaelV Nsil | 

Hin4I Msel CviRI | j 

I I I I I 

AGATGATTTATCTACAATCTTACTACAAGTCTTAAAAATGCATGACTTGAAGGTTGTTGA 

3S1 + + + + + + 420 

TCTACTAAATAGATGTTAGAATGATGTTCAGAATTTTTACGTACTGAACTTCCAACAACT 



Alul 

CviJI Maelll 

Ddel | Tsp4 5I 

Maell Mnll j Taal | 

I I I I I 

ACAAGGCAATAACGTCCTTATCTATCGTAATCCTCATCTTTCTAAGCTATCCACAGTAGT 

TGTTCCGTTATTGCAGGAATAGATAGCATTAGGAGTAGAAAGATTCGATAGGTGTCATCA 



Af 1III 
Mae 1 1 

Alul Acelll | Alul BstEII Sthl32I 

CviJI Msel | | CviJI Maelll Aval | Hgal 

I I I I I II II 

CACAGACAGCTCCTTAAAAGAAACGTGTGAAGCTGTTGTGGTTACCCGAGTGTTCCGTCT 



GTGTCTGTCGAGGAATTTTCTTTGCACACTTCGACAACACCAATGGGCTCACAAGGCAGA 
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Fig. 1 6 (con't) 



Mnll 
PstI 
Fnu4HI 
CviRI | 

Tsel | | | Bbvl Eco57I 
BsaHI CviJI Sfcl ||| j Sspl | SfaNI Nlalll | 

I I I Ml I II I II 

TTACAGGCGTCAGCCCTCTGCAGCAGTAAATATTATTCAACCTTTACTTTCCCATGATGC 

541 + + + + + + 600 

AATGTCCGCAGTCGGGAGACGTCGTCATTTATAATAAGTTGGAAATGAAAGGGTACTACG 



Alul 
CviJI 
Mwol | 

Hpyl88IX | j Nlalll Hpyl88IX TaqI 

III I I I 

TATCGTTAGTGCTTCAGAAGCTACTCGTCATGTTATCATCTCGGATATTGCTGGTAATGT 

601 + + + + + + 660 

ATAGCAATCACGAAGTCTTCGATGAGCAGTACAATAGTAGAGCCTATAACGACCATTACA 



Hpyl7 8III 
Bf al | 
Xbal | 
Alul 
CviJI 
Fnu4HI | 

ScrFI 

Cac8I || | Ml BsaJI | 
Bf al | || j jjj EcoRII j 
Nhel || || j jj j Acelll | j 
TspRI Ml jj j III Bbvl | j j MslI Nlalll 

I III II I I I I III I I I 

CGATAAAGTCAGTGATTTGCTAGCAGCTCTAGATTGCCCAGGCACATCTGTGGACATGAC 

661 + + + + + + 720 

GCTATTTCAGTCACTAAACGATCGTCGAGATCTAACGGGTCCGTGTAGACACCTGTACTG 



Faul 
Sthl32I | 
Alul | j 
CviJI | j 
Fnu4HI | | j Acelll 
Tsel | jjj Alul 
Acil || |j| CviJI 
Msel Mwol | || jj! Bbvl | Xcml 

I I II II I I II I 

TGAATACGAAGTTAAATATGCCAATCCCGCAGCTCTTGTTAGCTACTGCCAAGATGTTCT 

721 + + + + + + 780 

ACTTATGCTTCAATTTATACGGTTAGGGCGTCGAGAACAATCGATGACGGTTCTACAAGA 
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CviJI 
Haelll 
SfaNI | 
Rsal Eael | | 
Bcgl | Gdill | | 
I I M 



MboII 
Fokl | 



ScrFI 
Bcgl EcoRII | 
I I I 



Apol 
Tsp509I 
I 



TGGTACTCTGGCCGAAGATGATGCTTTCCAAATGTTCATCCAACCTGGAACGAACAAAAT 



ACCATGAGACCGGCTTCTACTACGAAAGGTTTACAAGTAGGTTGGACCTTGCTTGTTTTA 



HphI 
Mmel | 
MboII | | 
I I I 



Maell 
Earl | 
BsmAI | | 
BsmBI j | 

I I I 



BsmFI 

Hpyl78III 
AlwNI ' 
Alul 
CviJI 
Fnu4HI | 
Tsel| j 
II I 



Ddel 
Acelll | 
Bbvl | | 

I I I 



CviRI Mwol 

I I 

TTTCGTCGTCTCTTCACCACGTCTTGCAAATAAGGCAGAGCAGCTCCTGAAGTCCTTAGA 

841 + + + + + + 900 

AAAGCAGCAGAGAAGTGGTGCAGAACGTTTATTCCGTCTCGTCGAGGACTTCAGGAATCT 



BsaJI 
BstAPI 
Mwol 

Dpnl Rsal | 

Sau3AI j Seal j 

BslI Alwl | j TatI | j | CviJI 

ECOS7I | Bfal | | CviRI | j | Sty I Mnll 

II I II I I II I ! 

TGTCCCAGAAATGGCACATACCCTAGATGATCCTGCAAGTACTGCCTTGGCTTTGGGAGG 

901 + + + + + + 960 

ACAGGGTCTTTACCGTGTATGGGATCTACTAGGACGTTCATGACGGAACCGAAACCCTCC 



Ddel 
Banll 
Bspl286I 

BssSI | | Rsal 

Drdll | j | BsrGI | Alul 

NlaIV| | CviJI || Acil TatI j CviJI 

II I I M I Ml 

AACAGGAACCACGAGCCCTAAGAGTTTGCGGTTCTTTATGTACAAGCTGAAGTATCAAAA 

961 + + + + + + 1020 

TTGTCCTTGGTGCTCGGGATTCTCAAACGCCAAGAAATACATGTTCGACTTCATAGTTTT 
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Fig. 16 (con't) 

Maelll Alul 

Mnll | CviJI 

Eco57I EcoRV| j Maelll MslI 

I I I I II 

TGGAGAAGTGATTGCTAATGCCCTCCAAGATATCGGTTACAATCTATATGTAACCACAGC 

102 i + + + + + + 1080 

ACCTCTTCACTAACGATTACGGGAGGTTCTATAGCCAATGTTAGATATACATTGGTGTCG 

BciVI 
TspRI 

MboII Mnll | 

BstXI Msel | BsrI | | 

I M III 

TATGGACGAAGATTTCATTAACACTCTCAATAGTATCCAGTGGTTAGAGGTCAATAACTC 

1081 + + + + + + 1140 

ATACCTGCTTCTAAAGTAATTGTGAGAGTTATCATAGGTCACCAATCTCCAGTTATTGAG 



Hindi 
AccI | 

BsaJI TaqI | 

Hpyl88IX Styl Sail | j 

I I Ml 

CATAGTTATTATCGGAAACCAAGGGAATGTCGACAGAGTTATTGGCCTCTTAAACGGTTT 

1141 + + + + + + 1200 

GTATCAATAATAGCCTTTGGTTCCCTTACAGCTGTCTCAATAACCGGAGAATTTGCCAAA 



CviJI 

Hael Mnll 

Haelll Msel Taal 

I I I 



Ddel 

Hpyl78III Alulj 
Tsp509I Bf al | CviJI | 

Mnll TaqI Msel | Xbal | j Cj ePI || 

I I II III I M 

AGATTTACCTCCTAAACAGGTTTACATCGAAGTTTTAATTCTAGATACCAGCTTAGAGAA 

1201 + + + + + + 1260 

TCTAAATGGAGGATTTGTCCAAATGTAGCTTCAAAATTAAGATCTATGGTCGAATCTCTT 



ScrFI 
BsaJI | 
EcoRII | j 
III 



BsmFI 
CjePI 



CviRI | 
I I 



Bfal 
Avrll | 
BsaJI j 
Styl | 
CviJI | | 
BsrDI | j j 
I II 



Alul 

HphI CviJI 
I I 

ATCCTGGGACTTTGGAGTGCAATGGGTAGCCCTAGGTGATGAACAAAGTAAAGTAGCTTA 

1261 + + + + + + 1320 

TAGGACCCTGAAACCTCACGTTACCCATCGGGATCCACTACTTGTTTCATTTCATCGAAT 
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BspGI 
CjePI | 
Hpyl78III I 
I I 



CviJI CjePI 
BsrI | BsmFI | 

II II 

TGCTTCTGGACTATTGAATAATACTGGCATAGCCACACCTACAAAAGCAACTGTCCCTCC 

1321 + + + + + + 1380 

ACGAAGACCTGATAACTTATTATGACCGTATCGGTGTGGATGTTTTCGTTGACAGGGAGG 



Mspl 
BsaXl Neil 
Taal | ScrFI 

I I I 



Mnll 
Sthl32I 
Cac8I | 
II 



Dpnl 
Sau3AI 
MboII | 
TaqI j 
Drdll 
Alwl | 



ScrFI 
BslI | 
Pf 1MI | 
EcoRII | j 
I I I 



Muni 
Tsp509I 
ScrFI 
BslI I 



EcoRII I 
ECONI | j 
Mnll j j 
Earl | j | 

II 



Hpyl8 8IX 
Ddel | 
I I 



CGGCACGCCAAATCCTGGTTCGATCCCTCTTCCTACGCCAGGACAATTGACAGGGTTCTC 



GCCGTGCGGTTTAGGACCAAGCTAGGGAGAAGGATGCGGTCCTGTTAACTGTCCCAAGAG 



Hpyl8 8IX 
Hinfl | 

BseMII Earl Tfil j 

MboII | TaqI I |BsmI Bfal | | Ddel 

I I I I I II I I 

AGATATGCTGAACTCTTCGTCAGCATTCGGTCTAGGAATCATCGGAAATGTCCTAAGTCA 

1441 + + + + + + 1500 

TCTATACGACTTGAGAAGCAGTCGTAAGCCAGATCCTTAGTAGCCTTTACAGGATTCAGT 



Hpyl78III 
Dpnl | 

BslI Sau3AI | | 

Xmnl Mnll | CviJI Msel Ddel | j |BccI 

I I I II I I I I I 

TAAAGGGAAGTCTTTCCTTACTTTGGGAGGCTTATTAAGTGCCTTAGATCAAGATGGAGA 

1501 + + + + + + 1560 

ATTTCCCTTCAGAAAGGAATGAAACCCTCCGAATAATTCACGGAATCTAGTTCTACCTCT 
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Fig. 16 (con't) 



Hpyl78III 
BpulOI | 

Ddel j Alul 
Taal Hinfl Hinfl Cvi JI | | CviJI 

BsaXI j Tfil Tfil BciVljj | Hindi I I | 

| Bfal j Nlallljj j BseMII | | BsmFI 

III III I M I I 

TACTGTCATTGTCTTGAATCCTAGAATCATGGCTCAGGATACGCAACAAGCTTCGTTTTT 

1561 + + + + + + 1620 

ATGACAGTAACAGAACTTAGGATCTTAGTACCGAGTCCTATGCGTTGTTCGAAGCAAAAA 



Hin4I | Hpyl78III 

I I 



NlalV 

Avail | Maelll 

Sau95I j Taal 

Taal j AlwNI | 

II M 
TGTAGGGCAAACGGTCCCTTACCAAACTATCAAATACTATATCCAAGAAACAGGAACTGT 

1621 + + + + + + 1680 

ACATCCCGTTTGCCAGGGAATGGTTTGATAGTTTATGATATAGGTTCTTTGTCCTTGACA 



Mnll 
BsaXI | 

Clal Taal | | 

TaqI MboII Maelll Sfcl | j j 

I I I I I I I 

AACGCAAAATATCGATTATGAAGATATTGGAGTGAACCTTGTCGTTACCTCTACAGTTGC 

1681 + + + + + + 1740 

TTGCGTTTTATAGCTAATACTTCTATAACCTCACTTGGAACAGCAATGGAGATGTCAACG 



Hgal 
Tsp509I j 
Hpyl88IX | j 

Ddel | j j Thai 
Dpnl III! Acil | 

Maelll TaqI Sau3AI | j j | j BseMII j 

I I I I I I II II 

TCCCAACAATGTAGTTACACTACAAATCGAACAGACGATCTCAGAATTACATTCCGCGTC 

1741 + + + + + + 1800 

AGGGTTGTTACATCAATGTGATGTTTAGCTTGTCTGCTAGAGTCTTAATGTAAGGCGCAG 



CviJI Hpyl78III 
Dpnl Maelll FnulHI | Apol | 

Sau3AI | Tsp45I CviRI | j Tsp509I j 

Hpyl78IIl| j Alwl | Tsel j j Bbvl | j 

II I I I II I III 

TGGATCACTAACACCTGTCACAGATAAAACTTATGCAGCCACACGCTTACAAATTCCCGA 

1801 + + + + + + 

ACCTAGTGATTGTGGACAGTGTCTATTTTGAATACGTCGGTGTGCGAATGTTTAAGGGCT 
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Fig. 16 (con't) 



Sthl32I 

Taall Ddel Hpyl88IX Hpyl78III 

II I I I 

CGGTTGTTTCTTAGTTATGAGTGGGCATATCAGAGATAAAACTACAAAAGTGGTTTCAGG 

1861 + + + + + + 

GCCAACAAAGAATCAATACTCACCCGTATAGTCTCTATTTTGATGTTTTCACCAAAGTCC 



Bcefl Taql 
Tsp509I | Bed | 

Msel| j Rsal | j 

Vsplj | CviJI | | | 

III I I I I 

AGTGCCTTTGCTAAACTCCATACCATTAATTCGTGGTTTATTTAGCCGTACCATCGACCA 

1921 + + + + + + 1980 

TCACGGAAACGATTTGAGGTATGGTAATTAAGCACCAAATAAATCGGCATGGTAGCTGGT 



Bsu36I 

Nlalll Ddel Hpiil 

Hpyl78III | CviJI | Alul | 

Real | | Msel | j CviJI j 

III III II 

AAGGCAAAAACGCAATATCATGATGTTTATTAAGCCTAAGGTGATTAGTAGCTTTGAAGA 

1981 + + + + + + 2040 



Maelll 
MboII | 
BssSI | j 
I I 



Muni 

ciVI Tsp509I 

I I 
AGGCACTCGTGTTACCAATAAGGAAGGATACAGATACAATTGGGAAGCTGATGAAGGATC 

2041 + + + + + + 2100 

TCCGTGAGCACAATGGTTATTCCTTCCTATGTCTATGTTAACCCTTCGACTACTTCCTAG 



Dpnl 
NlalV 
BamHI 
BStYI 
Sau3AI 
HaelV 
Hin4I 
Alwl | 
Alul | j 
CviJI j j 
I I I 
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NlalV 

CviJI | Avail 
Haelllj ECOO109I 
Sau96I | | Psp5II 
BstXI | j j Sau96I 
MslI | III Sse8647I 
Alwl | j ill Hpyl78III BsmI | 

CviRI | | Ml Mnll | BsaJI | j CviJI 

Nlalll Ml NlaIIl| | Stylj j Mnll | 

I | | Ml III III II 

CATGCAAGTGGCCCCTCGCCATGCTCCTGAATGCCAAGGACCTCCTTCTTTACAGGCTGA 

2101 + --- + + + + + 2160 



GTACGTTCACCGGGGAGCGGTACGAGGACTTACGGTTCCTGGAGGAAGAAATGTCCGACT 



Ddel 

Maelll Dral Alul | BseMII 

Tsp45I Msel I CviJI j Mnll | Beef I 

I I! II M I 

AAGTGACTTTAAAATAATAGAAATAGAAGCTCAGTAGTGGTATATAAAAGAGGAAGATGA 

2161 + + + + + + 2220 

TTCACTGAAATTTTATTATCTTTATCTTCGAGTCATCACCATATATTTTCTCCTTCTACT 



BsaJI 
BstDSI 
Ecil | 
Acil| | 
MboII | j j 
I II I 



Hinf I 
Hpyl8 8IX 
Alul 
CviJI 
Plel 
XmnI | 
I I 



BsmI 

CviRI CviJI Bcefl 

i I I 

TATTCTCCGCCGTGGAATAGCTTCTGACTCTGTTGCATTCAGGGGGAAAGCCAAGAAGAT 

2221 + + + + + + 2280 

ATAAGAGGCGGCACCTTATCGAAGACTGAGACAACGTAAGTCCCCCTTTCGGTTCTTCTA 



Plel 
BsiEI | 
CviJI | 
Haelll | 
Eael 
EagI 
Gdill 
MboII | 
Hinf I | | 
HaelV | j | 
Hin4I j | | 
I I II 
GTAGAGTCGGCCGTATAACT 

2281 + + 2300 

CATCTCAGCCGGCATATTGA 
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Figure 17: CPN 100557 



tagcttgaaa tagcttcctc caattgtgat ttctgaagaa gtataggggg aaatgtcgaa SO 



gagatagtct tgttttaaag gaggagggga aaacggttta atg age aga aaa gac 

Met Ser Arg Lys Asp 
Arg Lys Asp 
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cct aga gag cgt act ate ggt tta tec gtg get eta gtt ate ggg ttt 

Pro Arg Glu Arg lie lie Gly Leu Ser Val Ala Leu Val lie Gly Phe 

Pro Arg Glu Arg lie lie Gly Leu Ser Val Ala Leu Val He Gly Phe 

185 190 195 

ttc ttc gaa tgg tta ate acg gtt cct gga gta tgg aaa ttt eta tta 

Phe Phe Glu Trp Leu He Thr Val Pro Gly Val Trp Lys Phe Leu Leu 

Phe Phe Glu Trp Leu He Thr Val Pro Gly Val Trp Lys Phe Leu Leu 

2O0 205 210 

gaa gcg aag age cca cct caa gaa cac gat agt gtt cga get tta tta 

Glu Ala Lys Ser Pro Pro Gin Glu His Asp Ser Val Arg Ala Leu Leu 

Glu Ala Lys Ser Pro Pro Gin Glu His Asp Ser Val Arg Ala Leu Leu 
215 220 225 

get ccc tta tct ttg ggt att tta act tea age ate ttc cag ctg aac 

Ala Pro Leu Ser Leu Gly lie Leu Thr Ser Ser He Phe Gin Leu Asn 

Ala Pro Leu Ser Leu Gly He Leu Thr Ser Ser He Phe Gin Leu Asn 
230 235 240 245 

ctt ctt tct gat ate tgc ttg get cgc tat gta cat gaa ata ggc cct 
Leu Leu Ser Asp He Cys Leu Ala Arg Tyr Val His Glu He Gly Pro 
Leu Leu Ser Asp He Cys Leu Ala Arg Tyr Val His Glu He Gly Pro 
250 255 260 

eta tat ctt atg tac tec tta aag att tat cag etc ccc ata cat etc 
Leu Tyr Leu Met Tyr Ser Leu Lys He Tyr Gin Leu Pro He His Leu 
Leu Tyr Leu Met Tyr Ser Leu Lys He Tyr Gin Leu Pro He His Leu 
265 270 275 

ttt ggc ttt ggt gtg ttt ace gtt etc etc cca gca att tct cgt tgt 

Phe Gly Phe Gly Val Phe Thr Val Leu Leu Pro Ala lie Ser Arg Cys 

Phe Gly Phe Gly Val Phe Thr Val Leu Leu Pro Ala He Ser Arg Cys 

280 285 290 

gta cag cga gaa gat cat gag agg gga ttg aaa ctt atg aag ttc gtt 

Val Gin Arg Glu Asp His Glu Arg Gly Leu Lys Leu Met Lys Phe Val 

Val Gin Arg Glu Asp His Glu Arg Gly Leu Lys Leu Met Lys Phe Val 

295 300 305 

etc ace eta acc atg tec gta atg ate att atg aca gca ggg eta ttg 

Leu Thr Leu Thr Met Ser Val Met He lie Met Thr Ala Gly Leu Leu 

Leu Thr Leu Thr Met Ser Val Met He lie Met Thr Ala Gly Leu Leu 

310 315 320 325 

etc tta get tta cct gga gtc cgt gtc ctt tat gaa cac gga ctt ttc 

Leu Leu Ala Leu Pro Gly Val Arg Val Leu Tyr Glu His Gly Leu Phe 

Leu Leu Ala Leu Pro Gly Val Arg Val Leu Tyr Glu His Gly Leu Phe 

330 335 340 

cct cag agt get gtc tac get att gtt cgt gta ttg cga ggt tat ggt 
Pro Gin Ser Ala Val Tyr Ala He Val Arg Val Leu Arg Gly Tyr Gly 
Pro Gin Ser Ala Val Tyr Ala He Val Arg Val Leu Arg Gly Tyr Gly 
345 350 355 

gec agt att ate cct atg gee ttg get cct tta gtc tct gtt ctt ttt 
Ala Ser He He Pro Met Ala Leu Ala Pro Leu Val Ser Val Leu Phe 
Ala Ser He He Pro Met Ala Leu Ala Pro Leu Val Ser Val Leu Phe 
360 365 370 
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Pro Leu Phe lie Gly lie Gly Thr 
Pro Leu Phe lie Gly lie Gly Thr 
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get ttg gec aat act gtt tta age 

Ala Leu Ala Asn lie Val Leu Ser 

Ala Leu Ala Asn lie Val Leu Ser 
390 395 



ttg gtt eta ggt cgt tgg gtt tta 

Leu Val Leu Gly Arg Trp Val Leu 

Leu Val Leu Gly Arg Trp Val Leu 

400 405 



aaa gac gtc teg ggc att tec tat get aca tec ata act get tgg gtg 

Lys Asp Val Ser Gly lie Ser Tyr Ala Thr Ser lie Thr Ala Trp Val 

Lys Asp Val Ser Gly lie Ser Tyr Ala Thr Ser lie Thr Ala Trp Val 
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taatcatgtt tgtttcttgt agctcagtcg ctttctttta gctttaagtt ttgatagect 1801 
gcttggtctt ctgtttctac acttaatatt gatactaagg atactatgaa aaaacaggta 1861 
tatcaatggt tagcgagtgt ggttctttta gegctgaca 1900 
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Figure 18 (RY-43) 

Restriction enzyme analysis of CPN100557 

Taql 

Alul Alul Muni Hpyl8SIX Eco57I | 

CviJT CviJI Tsp509I Mnll | MboII Earl | j 

I I I I I I II I 

TAGCTTGAAATAGCTTCCTCCAATTGTGATTTCTGAAGAAGTATAGGGGGAAATGTCGAA 

! + + + + + + 

ATCGAACTTTATCGAAGGAGGTTAACACTAAAGACTTCTTCATATCCCCCTTTACAGCTT 



Mnll 
Dral 
Msel | 
Mnll | | 
MboII | j | 
I III 



Msel 
BseRI | 
Taal | j 
II ■ 



Mnll 
I 



GAGATAGTCTTGTTTTAAAGGAGGAGGGGAAAACGGTTTAATGAGCAGAAAAGACAATGA 
CTCTATCAGAACAAAATTTCCTCCTCCCCTTTTGCCAAATTACTCGTCTTTTCTGTTACT 



Alul Hpyl78III 
CviJI Mspl | 

BpulOI | BsaWlj | 

Ddel |Tsp509I Msel BspEI j | Sfcl 

I I I I I I I I 

GGTTTCCTTAGCTCGTTCAATTTTTAATATATTATCCGGAACTTTCTGTAGTCGTATTAC 

121 + + + + + + 180 

CCAAAGGAATCGAGCAAGTTAAAAATTATATAATAGGCCTTGAAAGACATCAGCATAATG 

Muni 
Tsp509I 
Dpnl | 

Bbvl | j Fnu4HI 

Sau3AI j j Alul | 

Alul | j j CviJI j 

CviJI j j j Tsel j 

Alwl Ml | Cjel || 

I I I I I III 



Tsp509I 

Hpyl78III |CviRI Acelll 

Taql | I Cjel | BsrDI 

Mill I 



AGGGATATTTCGAGAAATTGCAATGGCAACCTATTTTGGAGCTGATCCAATTGTAGCTGC 

h + + + + + 

TCCCTATAAAGCTCTTTAACGTTACCGTTGGATAAAACCTCGACTAGGTTAACATCGACG 



Taal 
Rsal | 
I I 



Msel Mnll 
Af III | Apol | 
Smll|Tsp509I j 
II I I 



Hpyl7BIII 
Banll Bfal | 
Bspl286I Fokl j 
CviJI |XbaI | j 
I I Ml 



AAAGACCAATCCAAAGGCATGACAAAAAAAGAATTCTTTTTAAAATCCTCCCGAGTAAGA 
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Fig. 18 (con't) 



Mnll 
Apol | 
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I I 



BsiEI 
Pvul 
Dpnl 
BsmAI | 
Sau3Al| j 
Hpyl78III | | j 
BsiHKAI | | | j 

Bspl286I TaqI j j j 

I III' 



Taul 
Acil | 
I I 



AGAACAAGCCTTCATCCCTCATTTTGAATTTCTCCGTGCTCAAAGTCTCGATCGTGCGGC 

301 + + + + + + 360 

TCTTGTTCGGAAGTAGGGAGTAAAACTTAAAGAGGCACGAGTTTCAGAGCTAGCACGCCG 



Hpyl8 8IX 
I 



Hpyl78III 
Bf al | 
Hgal | 
Xbal | j 
I I 



Fnu4HI 
Tsel | 
Mmel | | 
Msel ( | j 
I I 



Bbvl 

I 

GTTTTTTTTCCGACGCTTTTCTAGATTGATTAAAGGCAGCACTATTATATTCACTCTGCT 

3S1 + + + + + + 420 

CAAAAAAAAGGCTGCGAAAAGATCTAACTAATTTCCGTCGTGATAATATAAGTGAGACGA 



MboII 
I 



Mnll 
Acll | 
Earl j 
Maell j 
I I 



BsmFI 
Hinf I 
MboII Tfil 
I I 



TATTGAAGCAGTATTGTGGGTATTCTTCAATAACGTTGAAGAGGGGACTTACGATATGAT 

421 + + + + + + 480 

ATAACTTCGTCATAACACCCATAAGAAGTTATTGCAACTTCTCCCCTGAATGCTATACTA 



Rsal 
BsrGI | 
Msel TatI j 
I I I 

TCTCCTTACTATGATACTCTTGCCCTGTGGCATTTTCTTAATGATGTACAATGTAAACGG 

481 + + + + + + 540 

AGAGGAATGATACTATGAGAACGGGACACCGTAAAAGAATTACTACATGTTACATTTGCC 



Haell 
Hhal | 
I I 



Sthl32I 
TspRI | 
Taal | j 
Beef I | I j 
I I I I 



Sthl32I 
BscGI | 
Alul | | 
CviJI j | 
I I I 



CGCTTTGCTTCACTGTGGAAATAAGTTTTTCGGGGTGGGATTAGCTCCCGTAGTTGTAAA 

541 + + + + + + 600 

GCGAAACGAAGTGACACCTTTATTCAAAAAGCCCCACCCTAATCGAGGGCATCAACATTT 
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CviJI 
Fnu4HI | 
Taul j 
Acil| | 
II I 



Dpnl 
BstYI 
Sau3AI 
Hpyl88IX| 
Alwl | | 
I II 



TATCATTTGGATTTTCTTTGTTATAGCGGCTCGTCATTCAGATCCTAGAGAGCGTATTAT 

601 + + + + + + 660 

ATAGTAAACCTAAAAGAAACAATATCGCCGAGCAGTAAGTCTAGGATCTCTCGCATAATA 



Bfal 
Sthl32I 
CviJI | 
BsaJI | | 

BstDSI j j 

I I 



MboII 

I 



Cjel 
NspV | 
TaqI |MseI 
I I I 



ScrFI 
EcoRII | 

NlaIV| j 
Taal | j j 
I II I 



CGGTTTATCCGTGGCTCTAGTTATCGGGTTTTTCTTCGAATGGTTAATCACGGTTCCTGG 
GCCAAATAGGCACCGAGATCAATAGCCCAAAAAGAAGCTTACCAATTAGTGCCAAGGACC 



Hpyl7 8III 
MboII 

Banll 
Bspl286I 
Hin4I 
CviJI |SmlI| 
I I 

AGTATGGAAATTTCTATTAGAAGCGAAGAGCCCACCTCAAGAACACGATAGTGTTCGAGC 

721 + + + + + + 780 

TCATACCTTTAAAGATAATCTTCGCTTCTCGGGTGGAGTTCTTGTGCTATCACAAGCTCG 



Bce83I 
Earl | 
Sapl j 

Apol Bpml | j 

Tsp509I Cjel | j j 
I I I II 



BplI Alul 
Ms II j CviJI 
Mnll | I TaqI | 
I I I I I 



Tthlllll 
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CviJI 
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CviJI MboII PvuII 

Mwol | Msel | Sf aNI | 

II II II 

TTTATTAGCTCCCTTATCTTTGGGTATTTTAACTTCAAGCATCTTCCAGCTGAACCTTCT 

781 + + + + + + 840 

AAATAATCGAGGGAATAGAAACCCATAAAATTGAAGTTCGTAGAAGGTCGACTTGGAAGA 



CviJI 
Haelll 
EcoC-1091 j 
Nlalll | [ 

Cac8I Rsal | j | Rsal 

Mwol | BsrGI | j jj TatI | 

CviJI | | TatI j |Sau96l| Mnll | j 

III II I II ill 

TTCTGATATCTGCTTGGCTCGCTATGTACATGAAATAGGCCCTCTATATCTTATGTACTC 

841 + + + + + + 900 

AAGACTATAGACGAACCGAGCGATACATGTACTTTATCCGGGAGATATAGAATACATGAG 



EcoRV 
Hpyl88IX | 
Tthlllll | j 
I I 
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Fig. 18 (con't) 
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CviJI 
I 



BseRI Taal 
I I 



CTTAAAGATTTATCAGCTCCCCATACATCTCTTTGGCTTTGGTGTGTTTACCGTTCTCCT 
GAATTTCTAAATAGTCGAGGGGTATGTAGAGAAACCGAAACCACACAAATGGCAAGAGGA 



Mnll 
Tsp509I | 
I I 



Rsal 
BsrGI | 
Tat I | 
I I 



MboII 
Nlalll 
Hpyl78III 
Real 
Dpnl | 
Mnll j 
Sau3AI | j 
I I I 



CCCAGCAATTTCTCGTTGTGTACAGCGAGAAGATCATGAGAGGGGATTGAAACTTATGAA 



GGGTCGTTAAAGAGCAACACATGTCGCTCTTCTAGTACTCTCCCCTAACTTTGAATACTT 



Dpnl 
Bell | 

HphI Nlalll Sau3AI j CviJI Ddel 

I 111 II 

GTTCGTTCTCACCCTAACCATGTCCGTAATGATCATTATGACAGCAGGGCTATTGCTCTT 

021 + + + + + + : 

CAAGCAAGAGTGGGATTGGTACAGGCATTACTAGTAATACTGTCGTCCCGATAACGAGAA 

BsaXI 
Hin4I 
Hinfl | 

ScrFI | | BseMII 
EcoRII | | | Mwol 
Alul III I Hpyl88IX Accl j 

CviJI iPlel Bpml Ddel | Mnll | | 

I I I I I I I M III 

AGCTTTACCTGGAGTCCGTGTCCTTTATGAACACGGACTTTTCCCTCAGAGTGCTGTCTA 

081 + + + + + + 

TCGAAATGGACCTCAGGCACAGGAAATACTTGTGCCTGAAAAGGGAGTCTCACGACAGAT 



BsrI 
NlalV | 
BanI j | 

I I 



BsaJI 
Styl 
CviJI | 
Haelj NlalV 
Haelll j CviJI | 
II II 



CGCTATTGTTCGTGTATTGCGAGGTTATGGTGCCAGTATTATCCCTATGGCCTTGGCTCC 



GCGATAACAAGCACATAACGCTCCAATACCACGGTCATAATAGGGATACCGGAACCGAGG 
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Fig. 18 (con't) 

Fnu4HI 
Taul 
Acil | 

MspAllj BsrBI Hinfl 

BsmAI CviRI | | Acil | Tfil 

I III II I 

TTTAGTCTCTGTTCTTTTTTATGCACAGCGGCAGTATGCTGTTCCGCTCTTTATAGGAAT 

1201 + + + + + + : 

AAATCAGAGACAAGAAAAAATACGTGTCGCCGTCATACGACAAGGCGAGAAATATCCTTA 

CviJI Alul BsaHI 

Hael CviJI Maell 

Haelll Hindlll | Sthl32l | 

CviJI MscI Bcefl | | Bfal Dral | | 

Rsal lEael | Sspl | Msel | |DrdII | Msel | | | 

III! II III II II I I 
CGGTACGGCTTTGGCCAATATTGTTTTAAGCTTGGTTCTAGGTCGTTGGGTTTTAAAAGA 

1261 + + + + + + 

GCCATGCCGAAACCGGTTATAACAAAATTCGAACCAAGATCCAGCAACCCAAAATTTTCT 

Fokl 
BsmAI 
BsmBI 
Aval | 
Aatll | j 
I I I 



Taqll 

Tthlllll | CviRI MboII 

.... II i I 

CGTCTCGGGCATTTCCTATGCTACATCCATAACTGCTTGGGTGCAGTTATATTTCCTCTG 

1321 + + + + + + 

GCAGAGCCCGTAAAGGATACGATGTAGGTATTGACGAACCCACGTCAATATAAAGGAGAC 

Plel Fokl 

Mnll NspV| Rsal Maelll | SfaNI 

Bsgl | Taqlj Hinfl TatI |DdeI | j Mspl | 

I I II I Mill II 

GTATTATTCTTCGAAAAGACTCCCTATGTACTCTAAGTTACTTTGGGAGAGCATCCGGCG 

1381 + + + + + + : 

CATAATAAGAAGCTTTTCTGAGGGATACATGAGATTCAATGAAACCCTCTCGTAGGCCGC 

Tthlllll 
Drdll | 

NlaIV| | Msel 
Tthlllll | j I CviJI | 

Xcml | Ml CacSI Bfal | | 

I I III I IM 

TTCCATAAAAGTTATGGGAACCACTATGCTTGCTTGTATGATTACTCTAGGCTTAAATAT 

1441 + + + + + + 

AAGGTATTTTCAATACCCTTGGTGATACGAACGAACATACTAATGAGATCCGAATTTATA 



1380 
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Tsp509I 
Ndel | 
I 



Msel 
i 



Bce83I 
CjePI 
CviJI 
Haelll 
Sau96I | 
BstXI | | 
Mnll | j | 
I II 



CCTTACGCAAACTACATATGTAATTTTCTTAAACCCCCTCACACCACTTGCTTGGCCCTT 

GGAATGCGTTTGATGTATACATTAAAAGAATTTGGGGGAGTGTGGTGAACGAACCGGGAA 

Ddel 
Hpyl8 8IX 
Tthlllll 

CjePI 
Bbvl 



Muni 
Tsp5 09I 
Smll | Acelll | 

CviJI j | Beef I | j 

Mnll | j |BseMII j j 

III I II' 



CviRI 
Fnu4HI 
Alul | 
CviJI j 
Tsel j 
I I 



CviJI 
Tthlllll | 
I 



ATCCTCCATAACGGCTCAAGCAATTGCTTTTTTATCTGAGAGCTGCATTTTCTTGGCTTT 

1561 + + --- + + + + 1620 

TAGGAGGTATTGCCGAGTTCGTTAACGAAAAAATAGACTCTCGACGTAAAAGAACCGAAA 

Dpnl MboII 
Bglll (Tsp509l| 
BstYI | Msel I I 
CviRI TaqI Sau3AI j Vspl | j CviJI 

I I II IN I 

TTTGTTTGGTTTTGCAAAACTGCTTCGAGTAGAAGATCTTATTAATTTGGCTTCTTTTGA 

1621 + + + + + + 1680 

AAACAAACCAAAACGTTTTGACGAAGCTCATCTTCTAGAATAATTAAACCGAAGAAAACT 



BsrI 
Sthl32I 



SimI 
BscGI | 
I I 



BsaAI 
Pmll 
Maell 
Afllll | 
SfaNI | j 
CviRI |BsbI | 

I III 



CviRI 
I 



Tsp5 0 9I 
Tthlllll 



ATACTGGCGTGGGCAACGGGGTCTTTTGCAAAGACAACACGTGATGCAAGACACTCAAAA 
TATGACCGCACCCGTTGCCCCAGAAAACGTTTCTGTTGTGCACTACGTTCTGTGAGTTTT 



Msel 
Vspl 



Ddel 
Alul | 
CviJI j 
I I 



Tthlllll 
Msel | 
Alul | j 
CviJI | j 
BseMII | j | 
I I 



Bbsl 
Cac8I 
MboII 
CviJI | 
I I 



TTAATCATGTTTGTTTCTTGTAGCTCAGTCGCTTTCTTTTAGCTTTAAGTTTTGATAGCC 



AATTAGTACAAACAAAGAACATCGAGTCAGCGAAAGAAAATCGAAATTCAAAACTATCGG 
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Sspl Ddel 
Msel | BciVI | 
II II 

TGCTTGGTCTTCTGTTTCTACACTTAATATTGATACTAAGGATACTATGAAAAAACAGGT 

180 i + + + + + + I860 

ACGAACCAGAAGACAAAGATGTGAATTATAACTATGATTCCTATGATACTTTTTTGTCCA 

Haell 
Hhal | 

Drdll Eco47III | | 

I III 
ATATCAATGGTTAGCGAGTGTGGTTCTTTTAGCGCTGACA 

1861 + + + + 1900 

T ATAG TTAC C AAT CG CT C AC ACC AAG AAAAT CG CG AC TGT 



83/165 



SUBSTITUTE SHEET (RULE 26) 



WO 00/24765 



Title: CHLAMYDIA AMM^I^H^'ii HT- «■■ O O E* 

CORRESPONDING DNA FRAGMENTS 09/8H044(S> 
AND USES THEREOF U O^U^SfO 

Inventor(s): Andrew D. MURDIN et al PCT/CA99/00992 
DOCKET NO.: 032931/0251 



Figure 19: CPN 100622' 

tctcaagagt aaccttatcc ttagattatt cagctcaagt ctcctcgtca actgtaggcc 60 

aataccttaa agctgagagt cattgcacat tttaaccaca atg aaa aca tea agg lis 

Met Lys Thr Ser Arg 
1 5 

aat aaa cag tgc aaa ata aca gat ccc tta agt aaa tct tec ttc ttt 163 
Asn Lys Gin Cys Lys lie Thr Asp Pro Leu Ser Lys Ser Ser Phe Phe 
10 . 15 20 

gtt gga gec tta att tta ggt aaa act aca ata etc ctt aat gcg act 211 
Val Gly Ala Leu He Leu Gly Lys Thr Thr He Leu Leu Asn Ala Thr 
25 30 35 

ccg ttg tct gac tat ttt gat aat caa gca aat caa etc aca aca etc 259 
Pro Leu Ser Asp Tyr Phe Asp Asn Gin Ala Asn Gin Leu Thr Thr Leu 
40 45 50 

ttc cct eta att gat act ctt act aac atg act ccc tac tct cat aga 307 
Phe Pro Leu He Asp Thr Leu Thr Asn Met Thr Pro Tyr Ser His Arg 
55 60 65 

gca aca ctt ttt gga gtt agg gat gac act aac caa gac att gtc etc 355 
Ala Thr Leu Phe Gly Val Arg Aso Asp Thr Asn Gin Asd He Val Leu 
70 75 80 " 85 

gat cac cag aat tec ata gaa age tag ttc gaa aac ttc tct caa gac 403 
Asp His Gin Asn Ser He Glu Ser Trp Phe Glu Asn Phe Ser Gin Asp 
90 95 100 

ggc ggt get etc tct tgc aaa tea ctt gee ata acg aat aca aaa aac 451 
Gly Gly Ala Leu Ser Cys Lys Ser Leu Ala He Thr Asn Thr Lys Asn 
105 no us 

caa att ctt ttc eta aat age ttt get att aaa aga get ggt gcg atg 499 
Gin He Leu Phe Leu Asn Ser Phe Ala He Lys Arg Ala Gly Ala Met 
120 125 130 

tat gtt gat ggt aat ttc gat ctt tct gag aat cat ggt tec ate att 547 
Tyr Val Asp Gly Asn Phe Asp Leu Ser Glu Asn His Gly Ser He He 
135 140 145 

ttc tct ggg aat tta age ttt cct aat gca agt aat ttc get gat act 5 95 
Phe Ser Gly Asn Leu Ser Phe Pro Asn Ala Ser Asn Phe Ala Asp Thr 
150 155 160 165 



tgt aca ggg gga get 
Cys Thr Gly Gly Ala 
Thr Gly Gly Ala 
170 

aat caa gga ace gca 
Asn Gin Gly Thr Ala 
Asn Gin Gly Thr Ala 
185 



gtc tta tgt teg aaa aat 

Val Leu Cys Ser Lys Asn 

Val Leu Cvs Ser Lvs Asn 
175 

tac ttc att aac aac aag 

Tyr Phe He Asn Asn Lvs 

Tyr Phe He Asn Asn Lys 
190 



gtt aca ate tea aaa 643 
Val Thr He Ser Lys 
Val Thr He Ser Lvs 
180 

gca aaa tct tea gga 691 
Ala Lys Ser Ser Giy 
Ala Lys Ser Ser Gly 
195 
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gga gca ate caa get gca ate ata aac att aag gac aac act ggc cct 739 
Gly Ala He Gin Ala Ala He He Asn lie Lys Asd Asn Thr Gly Pro 
Gly Ala He Gin Ala Ala He Ha A^n He Lys Asp Asn Thr Gly Pro 

200 205 210 

tgc ctg ttt ttt aac aat get gca ggc gga aca gcg ggg ggc gcg ttg 787 
Cys Leu Phe Phe Asn Asn Ala Ala Gly Gly Thr Ala Gly Gly Ala Leu 
Cys Leu Phe Phe Asn Asn Ala Ala Gly Gly Thr Ala Gly Gly Ala Leu 
215 220 225 

ttc get aat get tgt aga att gag aat aat tct cag cct ate tat ttt 835 

Phe Ala Asn Ala Cys Arg He Glu Asn Asn Ser Gin Pro He Tyr Phe 

Phe Ala Asn Ala Cys Axg He Glu Asn Asn Ser Gin Pro He Tyr Phe 

230 235 240 245 

ttg aat aac caa tea ggt ctg ggt ggt gca ata aga gta cat caa gag 883 

Leu Asn Asn Gin Ser Gly Leu Gly Gly Ala He Arg Val His Gin Glu 

Leu Asn Asn Gin Ser Gly Leu Gly Gly Ala He Arg Val His Gin Glu 
250 255 260 

tgc att ctt aca aag aat ace ggt tct gtg ate ttc aac aat aac ttt 931 

Cys He Leu Thr Lys Asn Thr Gly Ser Val He Phe Asn Asn Asn Phe 

Cys He Leu Thr Lys Asn Thr Gly Ser Val He Phe Asn Asn Asn Phe 
265 270 275 

gec atg gaa gcg gac ate tct get aac cat tec tct gga ggg get ate 97 9 

Ala Met Glu Ala Asp He Ser Ala Asn His Ser Ser Gly Gly Ala lie 

Ala Met Glu Ala Asp He Ser Ala Asn His Ser Ser Gly Gly Ala He 

280 285 290 

tat tgc att agt tgc tct ata aaa gac aac cca gga att gca gee ttc 1027 

Tyr Cys He Ser Cys Ser He Lys Asp Asn Pro Gly lie Ala Ala Phe 

Tyr Cys He Ser Cys Ser He Lys Aso Asn Pro Glv He Ala Ala Phe 
295 300 305 

gat aat aat act gca gca cga gat gga ggt get ate tgt aca caa tct 1075 

Asp Asn Asn Thr Ala Ala Arg Asp Gly Gly Ala lie Cys Thr Gin Ser 

Asp Asn Asn Thr Ala Ala Arg Asd Gly Gly Ala lie Cys Thr Gin Se^- 

310 315 320 325 

eta act ata caa gac agt ggt cec gtc tat ttc aca aac aat cag gga 1123 

Leu Thr He Gin Asp Ser Gly Pro Val Tyr Phe Thr Asn Asn Gin Glv 

Leu Thr He Gin Asp Ser Gly Pro Val Tyr Phe Thr Asn Asn Gin Gly 

330 335 340 

act tgg ggc ggc get ate atg etc cgt caa gat ggt gca tgc act tta 1171 

Thr Trp Gly Gly Ala lie Met Leu Arg Gin Asp Gly Ala Cys Thr Leu 

Thr Trp Gly Gly Ala lie Met Leu Arg Gin Asp Gly Ala Cys Thr Leu 
345 350 355 

ttt- get gat cag gga gat att att ttt tat aat aat aga cac ttc aaa 1219 

Phe Ala Asp Gin Gly Asp He lie Phe Tyr Asn Asn Arg His Phe Lys 

Phe Ala Asp Gin Gly Asp He He Phe Tyr Asn Asn Arg His Phe Lys 

360 365 370 

gat act ttc age aat cat gtt tct gta aac tgc acg cgt aat gtc tea 1267 

Asp Thr Phe Ser Asn His Val Ser Val Asn Cys Thr Arg Asn Val Ser 

Asp Thr Phe Ser Asn Hxs Val Ser Val Asn Cvs Thr Arg Asn Val Se*- 

375 380 ' 385 
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tta aca gtt gga gca agt caa ggt cat tct get acc ttc tat gat ccc 1315 

Leu Thr Val Gly Ala Ser Gin Gly His Ser Ala Thr Phe Tyr Asp Pro 

Leu Thr Val Gly Ala Ser Gin Gly His Ser Ala Thr Phe Tyr Aso Pro 
390 395 400 " 405 

ata eta caa aga tat act ata caa aac tct ate caa aaa ttt aat cct 1363 

lie Leu Gin Arg Tyr Thr lie Gin Asn Ser lie Gin Lys Phe Asn Pro 

lie Leu Gin Arg Tyr Thr lie Gin Asn Ser lie Gin Lys Phe Asn Pro 
410 415 420 



aat cca gaa cac etc gga act ate ttg ttc tec tea aca tat att ccg 1411 

Asn Pro Glu His Leu Gly Thr lie Leu Phe Ser Ser Thr Tyr lie Pro 

Asn Pro Glu His Leu Gly Thr lie Leu Phe Ser Ser Thr Tyr lie Pro 

425 430 435 



gat aca teg act tct 
Asp Thr Ser Thr Ser 
Aso Thr Ser Thr Ser 
440 



cgt gat gac ttc att tea 
Arg Asp Asp Phe lie Ser 
Arg Asp Asd Phe lie Ser 
4 45 



cat ttc aga aac cac 1459 

His Phe Arg Asn His 

His Phe Arg Asn His 
450 



att gga ctg tac aac ggc aca etc get ctt gaa gat cga gca gag tgg 1507 

lie Gly Leu Tyr Asn Gly Thr Leu Ala Leu Glu Asp Arg Ala Glu Trp 

lie Glv Leu Tyr Asn Gly Thr Leu Ala Leu Glu Asp Arg Ala Glu Trp 

455 460 465 



aaa gtc tat aaa ttt gat caa ttt ggt ggg act eta egg tta ggc agt 1555 

Lys Val Tyr Lys Phe Asp Gin Phe Gly Gly Thr Leu Arg Leu Gly Ser 

Lvs Val Tyr Lys Phe Asp Gin Phe Gly Gly Thr Leu Arg Leu Gly Ser 

470 " 475 480 485 



aga get gtg ttt tct aca aca gac gaa gaa caa agt age agt agt gtg 1603 

Arg Ala Val Phe Ser Thr Thr Asp Glu Glu Gin Ser Ser Ser Ser Val 

Arg Ala Val Phe Ser Thr Thr Asp Glu Glu Gin Ser Ser Ser Ser Val 

490 495 500 



ggt tct gta att aac ate aat aat ctt gca att aac ctt ccc tct ate 1651 

Gly Ser Val lie Asn lie Asn Asn Leu Ala lie Asn Leu Pro Ser lie 

Gly Ser Val He Asn lis Asn Asn Leu Ala lie Asn Leu Pro Ser He 

505 510 515 



tta ggc aac aga gtt get ccc aag eta tgg att cgc ccc aca ggt tea 1699 

Leu Gly Asn Arg Val Ala Pro Lys Leu Trp He Arg Pro Thr Gly Ser 

Leu Gly Asn Arg Val Ala Pro Lys Leu Trp He Arg Pro Thr Gly Ser 

520 525 530 



tea gca ccc tat age gaa gat aat aac cct ata ate aat etc tea gga 1747 

Ser Ala Pro Tyr Ser Glu Asp Asn Asn Pro He He Asn Leu Ser Gly 

Ser Ala Pro Tyr Ser Glu Asp Asn Asn Pro He He Asn Leu Ser Gly 
535 540 545 



cct ttg age eta 
Pro Leu Ser Leu 
Pro Leu Ser Leu 
550 

gac ctt gee caa 
Asp Leu Ala Gin 
Asp Leu Ala Gin 



ctg gat gac gag 
Leu Asp Asp Glu 
Leu Aso Asp Glu 
555 

cct ate gca gaa 
Pro He Ala Glu 
Pro He Ala Glu 
570 



aac eta gat ccc 
Asn Leu Asp Pro 
Asn Leu Asp Pro 
560 

gtt cct ctt ctg 
Val Pro Leu Leu 
Val Pro Leu Leu 
575 



Tyr Asp Thr Ala 
Tyr Aso Thr Ala 
565 

tat etc tta gac 1843 
Tyr Leu Leu Asp 
Tyr Leu Leu Aso 
580 
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He 
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Leu 
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Cys 




Leu 
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cct 


etc 


cac 


ttc 


act 


cct 


ttt 


2563 


Pro 


Leu 


His 


Phe 


Thr 


Pro 


Phe 




Pro 


Leu 


His 


Phe 


Thr 


Pro 


Phe 





Thr Glu Gly Lys Cys Ty: 

790 795 

tct eta tct eta caa tgg 

Ser Leu Ser Leu Gin Trp Arg Ser Arg 

Se- Leu Ser Leu Gin Trp Arg Ser Arg 

810 815 820 

ate caa gca att gee gtt cgt tct aat caa act gcg ttt caa gaa age 

He Gin Ala He Ala Val Arg Ser Asn Gin Thr Ala Phe Gin Glu Ser 

He Gin Ala He Ala Val Arg Ser Asn Gin Thr Ala Phe Gin Glu Ser 
825 830 835 

tat aac ctg 

Gly Asp Lys Ala Arg 



Gly Asp Lys Ala Arg 



aca gtc 
Thr val 
Thr Val 



aaa 




tct 


gtt 


cat 


aaa 


ccc 


tta 


Lys 


Phe 


Ser 


Val 


His 


Lys 


Pro 


Leu 




Phe 


Ser 


Val 
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Leu 
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aac 
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gag 
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get 
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cag 
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gtc 


etc 
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caa 


2755 
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He 


Glu 


Leu 


Ala 


Tyr 


Gin 


Pro 


Val 


Leu 


Tyr 


Gin 




Asn 


He 


Glu 


Leu 
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Tyr 


Gin 
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Val 


Leu 
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Ser 


Gly 
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Ser 


Tro 




He 
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Val 


Ser 


Leu 


Glu 


Ser 


Ser 


Gly 


Ser 


Ser 


Trp 





850 895 900 

etc eta tea gga acc ace ctt get cgc aat gec att get ttt aaa gga 

Leu Leu Ser Gly Thr Thr Leu Ala Arg Asn Ala He Ala Phe Lys Gly 

Leu Leu Ser Glv Thr Thr Leu Ala Arg Asn Ala He Ala Phe Lys Gly 

90S 910 915 

aaa ct 



caa 


att 


ttt 


ate 


ttc 


cct 


Gin 


He 


Phe 




Phe 


Pro 


Gin 


He 


Phe 


He 


Phe 


Pro 


920 
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Ala 


Gly 


His 


Tyr 


Leu 


His 


Ala 


Giv 




945 











950 

aatagagagt agtgaacact ctctaccatc atgaatctta tgggagaagc taagggaaat 3062 

ceacagatac gtttecccca taaaaat.aa gaaeccgata catcc'cact agagartcga 3122 

aagaactact taaatcctaa gcattcga 3150 
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2227 


tat 

Tyr 
710 


tct 
Ser 


ttg 
Leu 


gga 
Gly 


aca 
Thr 


ace 
Thr 
Thr 
715 


tea 
Ser 
Ser 


aac 
Asn 


act 
Thr 


get 
Ala 
Ala 


tct 

Ser 
720 


aat 
Asn 


cat 
His 


age 
Ser 


ttt 
Phe 


ggt 

Gly 
725 


2275 


gta 
Val 
Val 


aac 
Asn 
Asn 


ttc 
Phe 
Phe 


tec 
Ser 
Ser 


caa 
Gin 
Gin 
730 


ctt 
Leu 
Leu 


ttc 
Phe 
Phe 


agt 
Ser 
Ser 


aat 
Asn 
Asn 


etc 
Leu 
Leu 
735 


tac 
Tyr 
Tyr 


gag 
Glu 
Glu 


age 
Ser 
Ser 


cac 
His 
His 


tec 
Ser 
Ser 
740 


gac 
Asp 
Asp 


2323 


aat 
Asn 
Asn 


tec 
Ser 
Ser 


gtg 
Val 
Val 


get 
Ala 
Ala 
745 


teg 
Ser 
Ser 


cat 
His 
His 


acg 
Thr 
Thr 


aca 
Thr 
Thr 


act 
Thr 
Thr 
750 


gta 
Val 
Val 


gcg 
Ala 
Ala 


etc 
Leu 
Leu 


cag 
Gin 
Gin 


ate 
He 
lie 
755 


aat 
Asn 
Asn 


aat 
Asn 
Asn 


2371 


cct 
Pro 
Pro 


tgg 
Trp 
Trp 


ctg 
Leu 
Leu 
760 


caa 
Gin 
Gin 


gag 
Glu 
Glu 


aga 
Arg 
Arg 


ttc 
Phe 
Phe 


tct 
Ser 
Ser 
765 


aca 
Thr 
Thr 


tct 
Ser 
Ser 


gca 
Ala 
Ala 


tct 
Ser 
Ser 


eta 
Leu 
Leu 
770 


gec 
Ala 
Ala 


tac 
Tyr 
Tyr 


age 
Ser 
Ser 


2419 
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Figure 20 (RY-44) 

Restriction enzyme analysis of CPN 100622 



Maelll 
Hpyl78IIl | 
Smll | j 
I I I 



Ddel 
Bce83I I 



Smll 
Alul | 
CviJI j 
BseRI | | 
I I I 



Mnll 
Taal 
Sf cl 
Hindi | 
BsmAI | j 

Acelllj | | 

I 1 



TCTCAAGAGTAACCTTATCCTTAGATTATTCAGCTCAAGTCTCCTCGTCAACTGTAGGTC 



AGAGTTCTCATTGGAATAGGAATCTAATAAGTCGAGTTCAGAGGAGCAGTTGACATCCAG 



BsrDI 
Hinf I 
Ddel | 
Msel Alul | j | CviRI 
BseMII |CviJl| j j Plel 
II Mil I 



Msel 
I 



AATACCTTAAAGCTGAGAGTCATTGCACATTTTAACCACAATGAAAACATCAAGGAATAA 



TTATGGAATTTCGACTCTCAGTAACGTGTAAAATTGGTGTTACTTTTGTAGTTCCTTATT 



Mmel 
MboII 
Msel 



Af III 
Dpnl 
BstYI 
Sau3AI 
Alwl 
TspRI | 
CviRI | j 
Taal Ml | (Smll 

I 



I I I I 



Tsp509I 
Msel | 
CviJI | | 
Nlaiv| j | 
II 



ACAGTGCAAAATAACAGATCCCTTAAGTAAATCTTCCTTCTTTGTTGGAGCCTTAATTTT 



TGTCACGTTTTATTGTCTAGGGAATTCATTTAGAAGGAAGAAACAACCTCGGAATTAAAA 

Plel Hpyl88IX 
Msel | Hinf I DrdI | 
II I I I 
AGGTAAAACTACAATACTCCTTAATGCGACTCCGTTGTCTGACTATTTTGATAATCAAGC 
+ + + + + + 

TCCATTTTGATGTTATGAGGAATTACGCTGAGGCAACAGACTGATAAAACTATTAGTTCG 



Bsbl Hinfl 
Tthlllll | Tsp509I Nlalll 

MboII | j Earl | Mnll Plel | 

Ml III II 

AAATCAACTCACAACACTCTTCCCTCTAATTGATACTCTTACTAACATGACTCCCTACTC 

241 + + + + + + 300 

TTTAGTTGAGTGTTGTGAGAAGGGAGATTAACTATGAGAATGATTGTACTGAGGGATGAG 
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Fig. 20 (con't) 



Dpnl 
Sau3AI 
HphI | 
Tthllll | | 

Cjel Bsbl Cjel Fokl | j TaqI | 

| | I I I I II 

TCATAGAGCAACACTTTTTGGAGTTAGGGATGACACTAACCAAGACATTGTCCTCGATCA 

301 + + + + + + 360 

AGTATCTCGTTGTGAAAAACCTCAATCCCTACTGTGATTGGTTCTGTAACAGGAGCTAGT 



Apol 
EcoRI 
Tsp509I 
Mnll | 
I I 



NspV 
TaqI 
Drdll 
Bce83I | 
Alul j | 
CviJI | | 

! I 



BsiHKAI 

Hpyl78III Bspl286I CviRI 

Smll | Acil | Bcefl | 

II I I II 

CCAGAATTCCATAGAAAGCTGGTTCGAAAACTTCTCTCAAGACGGCGGTGCTCTCTCTTG 

361 + + + + + + 420 

GGTCTTAAGGTATCTTTCGACCAAGCTTTTGAAGAGAGTTCTGCCGCCACGAGAGAGAAC 



Acelll 

Apol Alul j 

Tsp509I CviJI J Msel 

I III 
CAAATCACTTGCCATAACGAATACAAAAAACCAAATTCTTTTCCTAAATAGCTTTGCTAT 

421 + + + + + + 480 

GTTTAGTGAACGGTATTGCTTATGTTTTTTGGTTTAAGAAAAGGATTTATCGAAACGATA 



Hinf I 
Ddel 
Hpyl8 8IX 
Dpnl 
Sau3AI 



Alul 
CviJI 
I 



BseMII | 
Tsp509I | j 

Bed | TaqI j 
I II 



NlalV 
Drdll | 
Tf il Nlalll | | 
I III 



TAAAAGAGCTGGTGCGATGTATGTTGATGGTAATTTCGATCTTTCTGAGAATCATGGTTC 



ATTTTCTCGACCACGCTACATACAACTACCATTAAAGCTAGAAAGACTCTTAGTACCAAG 



Alul 
CviJI 
Hindlll 
Msel | 
Apol | | 
Tsp509I | | 
I I I 



Rsal 
BsrGI | 

Tsp509I TatI | 

Bed Tsp509I | | | CviRI | Acelll | j 

I I I I I II IM 

CATCATTTTCTCTGGGAATTTAAGCTTTCCTAATGCAAGTAATTTCGCTGATACTTGTAC 

541 + + + + + + 600 

GTAGTAAAAGAGACCCTTAAATTCGAAAGGATTACGTTCATTAAAGCGACTATGAACATG 
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Alul NspV Acil 

CviJI TaqI Maelll NlalV | 

III M 
AGGGGGAGCTGTTTTATGTTCGAAAAATGTTACAATCTCAAAAAATCAAGGAACCGCATA 

601 + + + + + + 660 

TCCCCCTCGACAAAATACAAGCTTTTTACAATGTTAGAGTTTTTTAGTTCCTTGGCGTAT 

CviRI 
BseRI 
Fnu4HI 
Hin4I Alul | 

Bbvl | CviJI j 

Hpyl78III | j Tsel j 

Mnll | j j Mwol | j | Msel 

I III I II I I 

CTTCATTAACAACAAGGCAAAATCTTCAGGAGGAGCAATCCAAGCTGCAATCATAAACAT 

661 + + + + + + 720 

GAAGTAATTGTTGTTCCGTTTTAGAAGTCCTCCTCGTTAGGTTCGACGTTAGTATTTGTA 



Eco57I 
Msel | MboII 
I 



BsrI 
TspRI 
CviJI | 
Haelll | 
Sau96l| j 
Bsbl | | | 

III 



Bbvl Msel 

I ! 



Ecil 
Faul 
Acil 
Sthl32I 
Cac8I 
PstI 
CviRI 
Fnu4HI | 
Sfcl j 
Tsel | j 
II I 



Acil 
MspAlI 
I 



721 + + + + + + 780 

ATTCCTGTTGTGACCGGGAACGGACAAAAAATTATTACGACGTCCGCCTTGTCGCCCCCC 

Mwol 
Tthlllll | 

Hhal | j CviJI 

Thai [ j Tsp509I Tsp509I Ddel | BseMII 

III I III I 

CGCGTTGTTCGCTAATGCTTGTAGAATTGAGAATAATTCTCAGCCTATCTATTTTTTGAA 

781 + + + + + + 840 

GCGCAACAAGCGATTACGAACATCTTAACTCTTATTAAGAGTCGGATAGATAAAAAACTT 



Hpyl78III 
Rsal | BsmI 

Taqll CviRI Tatl | j CviRI 

I I I I I I 

TAACCAATCAGGTCTGGGTGGTGCAATAAGAGTACATCAAGAGTGCATTCTTACAAAGAA 

841 + + + + + + 900 

ATTGGTTAGTCCAGACCCACCACGTTATTCTCATGTAGTTCTCACGTAAGAATGTTTCTT 
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Acil 
Mwol 
Nlalll | 
BsaJI | I 
BstDSI j j 
Ncol j j 
Tsp509I Styl | J 

! II! 

TACCGGTTCTGTGATCTTCAACAATAATTTTGCCATGGAAGCGGACATCTCTGCTAACCA 

901 + + + + + + 960 

ATGGCCAAGACACTAGAAGTTGTTATTAAAACGGTACCTTCGCCTGTAGAGACGATTGGT 



Dpnl 
Sau3AI 
MboII | 
Mspl | j 
BsaWI | j j 
BsrFI j | | 
PinAI j j j 
I I 



CviJI 
Hin4I | 
BslI | j 
Hpyl78III | j j 
BstXI || || 
Mnll | j jMnll| 
III! II 



Bpml 
CviRI | 
I I 



Tsp509I 
ScrFI |Fnu4HI 
BsaJI | j CviRI | 
EcoRII j [ Tselj 
III II 



TTCCTCTGGAGGGGCTATCTATTGCATTAGTTGTTCTATAAAAGACAACCCAGGAATTGC 



AAGGAGACCTCCCCGATAGATAACGTAATCAACAAGATATTTTCTGTTGGGTCCTTAACG 



Mnll 
BssSI 
PstI | 
Fnu4HI | j 

CviRI !||| Rsal 
Tsel {ill Bbvl BsrGI | 

Sfcl III j j Bccl|Hin4I TatI j BsmFI 
I I i I I I II I II I 

AGCCTTCGATAATAATACTGCAGCACGAGATGGAGGTGCTATCTGTACACAATCTCTAAC 

1021 + + + + + + 1080 

TCGGAAGCTATTATTATGACGTCGTGCTCTACCTCCACGATAGACATGTGTTAGAGATTG 



TaqI 
CviJI | Bbvl 
I I I 



Sthl32I 
BscGI 
NlalV 



TspRI 
Avail | 
Sau96I j 
PshAI | j 
Taal| j | 
I I 



Haell 
Fnu4HI | 
Taul | 
Tthlllll AciljHhalj 
I II II 

TATACAAGACAGTGGTCCCGTCTATTTCACAAACAATCAGGGAACTTGGGGCGGCGCTAT 

1081 + + + + + + 1140 

ATATGTTCTGTCACCAGGGCAGATAAAGTGTTTGTTAGTCCCTTGAACCCCGCCGCGATA 
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CviRI 
Nlalll 
Nspl 
SphI 

Cac8I j Dpnl 
Hpyl78III CviRI | j Bell | 

Nlalll | Bed | j j Sau3AI | 

I I I Ml I I 

CATGCTCCGTCAAGATGGTGCATGCACTTTATTTGCTGATCAGGGAGATATTATTTTTTA 

1141 + + + + + + 1200 

GTACGAGGCAGTTCTACCACGTACGTGAAATAAACGACTAGTCCCTCTATAATAAAAAAT 

Mmel 
Thai 
Af 1III 
Cac8I 

Nlalll Mlul 
Bsgl | CviRI [ 

II M 
TAATAATAGACACTTCAAAGATACTTTCAGCAATCATGTTTCTGTAAACTGCACGCGTAA 

1201 + + + + + + 1260 

ATTATTATCTGTGAAGTTTCTATGAAAGTCGTTAGTACAAAGACATTTGACGTGCGCATT 

Dpnl 

Msel Sau3AI | 

BsmAI I Taal Alwl | | 

II I III 

TGTCTCATTAACAGTTGGAGCAAGTCAAGGTCATTCTGCTACCTTCTATGATCCCATACT 

12 61 + + + + + 1320 

ACAGAGTAATTGTCAACCTCGTTCAGTTCCAGTAAGACGATGGAAGATACTAGGGTATGA 

CjePI 
Msel | 

Apol | | Hpyl88IX 
Tsp509I | |Hpyl78III BsaJI | 

III I II 

ACAAAGATATACTATACAAAACTCTATCCAAAAATTTAATCCTAATCCAGAACACCTCGG 

1321 + + + + + + 1380 

TGTTTCTATATGATATGTTTTGAGATAGGTTTTTAAATTAGGATTAGGTCTTGTGGAGCC 

Hpyl78III 
Mspl ' 
BsaWI | 
BspEI | 

Mnll BciVI | j 

BseRlj CjePI Mnll j | 

II I I M 



Hpyl78III 
Taql BssSI | 

I I 



AACTATCTTGTTCTCCTCAACATATATTCCGGATACATCGACTTCTCGTGATGACTTCAT 
TTGATAGAACAAGAGGAGTTGTATATAAGGCCTATGTAGCTGAAGAGCACTACTGAAGTA 



93/165 



SUBSTITUTE SHEET (RULE 26) 



WO 00/24765 



Title: CHLAMYDIA ANTIGEMiANDZ; 
CORRESPONDING DNA FRAGMENTS 
AND USES THEREOF 
Inventor(s): Andrew D. MURDIN et al 
DOCKET NO.: 032931/0251 



PCT/CA99/00992 
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TaqI 

Rsal Dpnl 
BsrGI | Sau3AI | 

Taal j Bcefl | j 

Hpyl88IX TatI j Hpyl78IIll | j 

I II I i I I 

TTCACATTTCAGAAACCACATTGGACTGTACAACGGCACACTCGCTCTTGAAGATCGAGC 

AAGTGTAAAGTCTTTGGTGTAACCTGACATGTTGCCGTGTGAGCGAGAACTTCTAGCTCG 



MboII 

I 



Tsp509I 
Dpnl 
Bell 
Sau3AI 
Apol | 
Tsp509I j 



BsmFI 

Acelll | Alul 

Plel Hinfl Taal j CviJI 

II II I 



AGAGTGGAAAGTCTATAAATTTGATCAATTTGGTGGGACTCTACGGTTAGGCAGTAGAGC 
TCTCACCTTTCAGATATTTAAACTAGTTAAACCACCCTGAGATGCCAATCCGTCATCTCG 



MboII Msel 
RleAI | Tsp509I j 

II II 
TGTGTTTTCTACAACAGACGAAGAACAAAGTAGCAGTAGTGTGGGTTCTGTAATTAACAT 

1561 4- + + + + + 1620 

ACACAAAAGATGTTGTCTGCTTCTTGTTTCATCGTCATCACACCCAAGACATTAATTGTA 

BslI 

Msel PflMI 
Tsp509I | Mnll Alul | 

CviRI | j Ddel | CviJI j 

III II II 

CAATAATCTTGCAATTAACCTTCCCTCTATCTTAGGCAACAGAGTTGCTCCCAAGCTATG 

1621 + + + + + + 1680 

GTTATTAGAACGTTAATTGGAAGGGAGATAGAATCCGTTGTCTCAACGAGGGTTCGATAC 



Hinfl Sfcl 
Tfil RleAI | MboII 

I II I 

GATTCGCCCCACAGGTTCATCAGCACCCTATAGCGAAGATAATAACCCTATAATCAATCT 

1681 + + + + + + 1740 

CTAAGCGGGGTGTCCAAGTAGTCGTGGGATATCGCTTCTATTATTGGGATATTAGTTAGA 
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Avail 
EcoO109I 
Psp5II 
Sau96I 
Sse8647I 
Hpyl78III | 
Ddel | I 
I I 



BseMII 
CviJI 
I 



Dpnl 

BstYI | 

Sau3AI j 

Bfal | j 

Fokl J j 

Alwl j | j 

I II I 



BstAPI 
PstI | 
CviRI | j 
Sfcl | |MwoI 
I I I I 



1741 



CTCAGGACCTTTGAGCCTACTGGATGACGAGAACCTAGATCCCTATGATACTGCAGACCT 
GAGTCCTGGAAACTCGGATGACCTACTGCTCTTGGATCTAGGGATACTATGACGTCTGGA 



Aatll 

Maelll | 

Tsp45I j 

BsaHI | j 

Mnll Maellj j 
Earl | Ddel | | 
I I 



Alul 
CviJI 
I 



Msel 
Vspl 
I 



TGCCCAACCTATCGCAGAAGTTCCTCTTCTGTATCTCTTAGACGTCACAGCTAAACATAT 



ACGGGTTGGATAGCGTCTTCAAGGAGAAGACATAGAGAATCTGCAGTGTCGATTTGTATA 

Mwol 
BsaJI | 
Styl j 

BseMII Bsu36I BsmFI | | 

Tsp509I (Mnll Ddel SimI Bsbl Cvi JI | j | 

Mill ! II II 

TAATACGGATAATTTCTACCCTGAGGGTCTAAATACAACTCAACACTACGGCTACCAAGG 

1861 + + + + + + 1920 

ATTATGCCTATTAAAGATGGGACTCCCAGATTTATGTTGAGTTGTGATGCCGATGGTTCC 



NlalV 
Avail { 
Sau961 | 
Bcefl I j 

I I I 



TaqI 
Bsrl | 
Dpnl j 
Sau3AI | j 
BslI | j j Alwl 
I I II 



Mnll 

Hpyl88IX Earl | 

MboII Hpyl88IX| j 

I I II I 

CGTTTGGTCCCCTTACTGGATCGAAACAATCACAACTTCTGATACCTCTTCTGAAGATAC 

1Q?1 a. j j. i j. J- 1 Qfifl 



BstXI 
Alul j 

MboII CviJI | Sfcl 

Taal | Eco57I Cac8I | j HphI | 

III ill II 

TGTGAATACTTTACATCGCCAGCTTTATGGTGATTGGACACCTACAGGATATAAGGTAAA 

1981 + + + + + + 2040 

ACACTTATGAAATGTAGCGGTCGAAATACCACTAACCTGTGGATGTCCTATATTCCATTT 
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. 20 (con't) 



BsmAI BsrDI Mwol 

I I I 

CCCAGAAAACAAAGGAGACATTGCCCTATCTGCCTTCTGGCAATCTTTCCATAACTTATT 

2041 + + + + + + 2100 

GGGTCTTTTGTTTCCTCTGTAACGGGATAGACGGAAGACCGTTAGAAAGGTATTGAATAA 



Cjel 
Tthlllll 
CviJI | 
Hael j 
Haelll j 



Hpyl78III 

AlwNI 
Alul 
CviJI 
Mwol | 
Sfcl | | 
I I 



Plel 
Alul | 
CviJI j 
I I 



Maell 
I 

TGCGACACTACGTTATCAAACACAGCAAGGCCAAATAGCACCTACAGCTTCTGGAGAAGC 

2101 + + + + + + 2160 

ACGCTGTGATGCAATAGTTTGTGTCGTTCCGGTTTATCGTGGATGTCGAAGACCTCTTCG 



Cjel 
Hinfl | CviRI 
TaqI | | Earl | 
MboII | j |BpmI | j 
II III 



Hinfl 
Tf il 
XmnI | 
! I 



Sthl32I Alul 
Ndel | CviJI 
I I 



SfaNI CjePI 
I I 

TACTCGACTCTTCGTGCATCAAAATAGCAACAATGATGCGAAAGGATTCCATATGGAAGC 

2161 + + + + + + 2220 

ATGAGCTGAGAAGCACGTAGTTTTATCGTTGTTACTACGCTTTCCTAAGGTATACCTTCG 



Tthlllll 
TspRI [ 

CjePI Mnll | j Alul 

BscGI | Btslf j j CviJI 

II II I I I 

TACGGGTTATTCTTTGGGAACAACCTCAAACACTGCTTCTAATCATAGCTTTGGTGTAAA 

2221 + + + + + + 2280 

ATGCCCAATAAGAAACCCTTGTTGGAGTTTGTGACGAAGATTAGTATCGAAACCACATTT 



BsaJI 

BstDSI 
Tsp509I | 
Hpyl8 8IX | j 
CviJI | | | CviJI 

Pflll08I Hin4I j j |Bpll| 

I I I I I II 

CTTCTCCCAACTTTTCAGTAATCTCTACGAGAGCCACTCCGACAATTCCGTGGCTTCGCA 

2281 + + + + + + 2340 

GAAGAGGGTTGAAAAGTCATTAGAGATGCTCTCGGTGAGGCTGTTAAGGCACCGAAGCGT 
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Fig. 20 (con't) 



HaelV 
Hin4I 
Bbvl 
Dpnl 
Sau3AI j 
Hpyl78IIlj j 
Haell Mill CviRI 
Hhal | j | | j j Fnu4HI | 

Eco47IIl|| || II j Cvi JI | j Hin4I 

Mmel Taal | j j | j j j j BsaJI | j j Hinf I 
Bpml | Sf cl | III j | j j | Styl Tsel | j Tf il 

i I II III II II I I III I 

TACGACAACTGTAGCGCTCCAGATCAATAATCCTTGGCTGCAAGAGAGATTCTCTACATC 

2341 + + + + + + 2400 

ATGCTGTTGACATCGCGAGGTCTAGTTATTAGGAACCGACGTTCTCTCTAAGAGATGTAG 

Sf cl 
CviJI 
SfaNI 

Bfal | | Sfcl Hpyl78III 
Mwol j | Alul | SfaNI | 

CviRI Bpll| | j CviJI | Hpyl78III | j 

I li I I II III 

TGCATCTCTAGCCTACAGCTACAGCAACCACCATATCAAAGCATCTGGATATTCTGGAAA 

2401 + + + + + + 2460 

ACGTAGAGATCGGATGTCGATGTCGTTGGTGGTATAGTTTCGTAGACCTATAAGACCTTT 



CviJI 
Fnu4HI | 
Taul | 

Rsal Acil| | 

I II I 

AATACAAACGGAAGGCAAATGTTATAGTACGACATTAGGGGCGGCTCTCTCTTGCTCTCT 

2461 + + + + + + 2520 

TTATGTTTGCCTTCCGTTTACAATATCATGCTGTAATCCCCGCCGAGAGAGAACGAGAGA 



BsaXI 



Hpyl78III 
Dpnl | 
Sau3AI | | 
I I 



| Muni 
| Mill I Bcefl Tsp509I 

I I I I 

ATCTCTACAATGGCGATCACGACCTCTCCACTTCACTCCTTTTATCCAAGCAATTGCCGT 

2521 + + + + + + 2580 

TAGAGATGTTACCGCTAGTGCTGGAGAGGTGAAGTGAGGAAAATAGGTTCGTTAACGGCA 
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Tthlllll 
I 



Hpyl78III 



Fig. 20 (con't) 

Xmnl 
Apol 
Tsp509I 
Bf al j 
Alul | j 
Cvi JI j j 
I I i 

^TCGTTCTAATCAAACTGCGTTTCAAGAAAGTGGAGATAAAGCTAGAAAATTTTCTGTTCA 

2581 + + + + + + 2640 

AGCAAGATTAGTTTGACGCAAAGTTCTTTCACCTCTATTTCGATCTTTTAAAAGACAAGT 

Hinf I 
Bsp24I 
CjePI 
Cjel 
Haell 
Hhal 
Eco47III 
Hpyl8 8IX 
Mnll | 

Apol | j | | | | | Bbsl 

Hin4I EcoRI j j j j j j j MboII 

BsmFI Taal | Tsp509I j | | j j j Tf ±1 Alol | 

I II III I I I I I II 

TAAACCCTTATATAACCTGACAGTCCCTCTGGGAATTCAGAGCGCTTGGGAATCCAAGTT 

2641 + + + + + + 2700 

ATTTGGGAATATATTGGACTGTCAGGGAGACCCTTAAGTCTCGCGAACCCTTAGGTTCAA 



Cjel Cac8I 
CjePI Alul | Mnll 

Bsp24I| CviJI j CviJI BseMII | 

II II I I I 

CCGTCTTCCTACCTATTGGAACATAGAGCTTGCTTATCAGCCTGTCCTCTACCAACAAAA 

2701 + + + + + + 2760 

GGCAGAAGGATGGATAACCTTGTATCTCGAACGAATAGTCGGACAGGAGATGGTTGTTTT 
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Fig. 20 (con't) 



Dpnl 
Sau3AI 
Bfal 



Plel 
Hinf I 
Tf il 
Hpyl78III 
Bfal 
Xbal 
Hinf I 
Maell 



Dpnl 
Sau3AI | 
Ddel | j 
Hpyl78III j j 



Drdll 
NlalV 
Hpyl78III 
NlalV 
Cvi JI | 
Nlalll | j 
Alwl | | jcjePI 
I Ml 



TCCTGAGATCAACGTGAGTCTAGAATCTAGTGGATCGTCATGGCTCCTATCAGGAACCAC 

2761 + + + + + + 2820 

AGGACTCTAGTTGCACTCAGATCTTAGATCACCTAGCAGTACCGAGGATAGTCCTTGGTG 



Mboll 
MboII | 
A P Ol | | 
Tsp509l| | 
II I 

CCTTGCTCGCAATGCCATTGCTTTTAAAGGAAGAAACCAAATTTTTATCTTCCCTAAACT 

2821 + + + + + + 2880 

GGAACGAGCGTTACGGTAACGAAAATTTCCTTCTTTGGTTTAAAAATAGAAGGGATTTGA 



BsrDI 
BsrDl| Dral 
Mwol | j Msel | 
Cac8I | j | CjePI | j 
I I il I II 



Mnll 

Ddel CviJI BciVI | 

I I II 

TTCGGTGTTCTTAGACTATCAAGGCTCGGTATCCTCATCAACGACGACACATTACCTTCA 

2881 + + + + + + 2940 

AAGCCACAAGAATCTGATAGTTCCGAGCCATAGGAGTAGTTGCTGCTGTGTAATGGAAGT 

Dral Nlalll 
Msel Msel | Nspl CviRI 

III I I 

CG C AG GAACG AC CTTTAAGTTT TAAAAG C ATGTTAT ATAG AC AATG C AAC C TGT AAAG AC 

2941 + h + + + + 3000 

GCGTCCTTGCTGGAAATTCAAAATTTTCGTACAATATATCTGTTACGTTGGACATTTCTG 



CjePI 
I 



Hinf I 
Nlalll 
Tf il 
Hpyl78III | 
Real | j 
Bed | | | 
MM 



CjePI 
BpulOI | 
Ddel j 
Alul | j 
CviJI j | 
I I I 



CAAATAGAGAGTAGTGAACACTCTCTACCATCATGAATCTTATGGGAGAAGCTAAGGGAA 

3001 + + + + + + 3060 

GTTTATCTCTCATCACTTGTGAGAGATGGTAGTACTTAGAATACCCTCTTCGATTCCCTT 
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NspV 
TaqI 

Fokl Hinfl | 

Msel Hin4I Mill I j 

Mae I I Tsp509I | Sthl32I Bfal Tfil | 

I II I III 

ATC C AC AG ATACG TTTC C C C C ATAAAAATT AAG AAC C CG AT AC AT C CTC AC TAG AG ATT C 

3061 + + + + + + 3120 

TAGGTGTCTATGCAAAGGGGGTATTTTTAATTCTTGGGCTATGTAGGAGTGATCTCTAAG 



BsmI 
BpulOI | 
Msel Ddel j TaqI 

I III 
G AAAG AAC TACTTAAATC CTAAG C ATT C G A 

3121 + + + 3150 

CTTTCTTGATGAATTTAGGATTCGTAAGCT 
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Figure 21 A: CPN1 00626 Coding Sequence 



tcctgaactc cactcgaaat tactgattag 
tgtgacctac aatgctttag ggatcaaagt 
cactctctcc ttagattact ctgcggatat 
cgtggcgagt agaatgagat ttttaacaat 
tctagtgtca aagactcctc ctaagttttt 
gttcgggatg actcctgcag tgtatagttt 
agagagggat gaagagtttc gtacgagctt 
aggattttct ccaataacta cgtttgttgg 
actttctaac tacaagtcta ttgataacat 
tgtgtcctgt aataatttct tattatcaaa 
tctcgcgatt gggactggag gcgcgattgc 
tagaggaccc cttatttttt tcagcaatcg 
aactcgtggg ggtgcgattg cctgtaatgg 
tttctacttt gtcaacaatt ccgtcaacaa 
ctgccgcatc caaagcaaca gggcacctct 
agggggtgcg cttcgtagtg aaaatacaac 
taagaacaac tgtgggaaca atggcggggc 
aaataactcc gggtcggtga ttttcaataa 
aggaaatggt tcaggagggg cgatttatac 
aactattctt ttcaataata actactgcat 
tttgacaatc aaaaatagtg gccacgtata 
tgctcttatg ctcctacagg acagcacctg 
atttcaaaat aatgaggttt tcctcaccac 
accaaatagc aacttacaac ttggagctaa 
tatagaacac caacatccaa ctacaaatcc 
gggaacgatc ttattttctt cagcctatat 
cattagcagc tcgaaaaata cctctgaact 
tgcgggatgg caattctata agttcactca 
ggcgagtatt gcaacaactg ccaactctga 
catcattaat aaccttgcga ttaacctccc 
cttgtggatc cgtcctctac aatctagtgc 
tactttatca ggtcctctga cactcttaaa 
agatctctct gagcctttac aaaacattca 
tcatatcaat accgataact ttcatcctga 
tcaaggcatc tggtctcctt attgggtaga 
agagacggca aacaccctct acagagctct 
ggtcaatcct gaataccaag gagatcttgc 
tatgttctct ctattaagaa gttataatcg 
cttagaaatt caagggattg ccgacggcct 
tccaggattc cgtatccaat ctacagggta 
acatcagaaa atctccttag gtcttgcaca 
aagcaacaac gtctcggctc acaatacagt 
ccaagaggcc tttgcaacat cccacagctt 
cgcctacatc cgtcacatca agaacagggc 
agcagctatc ggctgttctt tcccttggca 
cgttcaggca attgcaatac gttctcacca 
ccgaaagttt gtctctcaaa agcctttcta 
aaaatggcag tcaaaattcc acgtacctac 
ggtactctat caacaaaatc cccaaatcgg 
ggatatccta ggccataact atgttcgcaa 
tgcgctcttc cgttctctcg atctattctt 
atctacgcac catctccaag caggaagtac 
attgaaatct ttagaattaa caactatccg 
ctccctcaaa atttaaataa 



ccaaggtacg tggacgacgc aggccactcc 60 
gaaaaatacc atgcaggtgt ttcctaaagt 12 0 
ttcttcctcc acgctgagtc actacttaaa 180 
aagtgaccaa aacagaaaga ttaaggaacc 24 0 
attctatctc gggaatttca cagcctgcat 300 
acaaacggac tcccttgaaa agtttgcttt 360 
ccctctctta gactctctct ccactcttac 420 
aaatagacat aattcctctc aagacattgt 480 
ccttcttctt tggacatcgg ctgggggagc 54 0 
tgttgaagac catgccttct tcagtaaaaa 600 
ttgccaggga gcctgcacaa tcacgaagaa 660 
aggtcttaac aatgcgagta caggaggaga 72 0 
agacttcacg atttctcaaa atcaagggac 780 
ctggggagga gccctctcca ccaatggaca 840 
actctttttt aacaatacag cccctagtgg 900 
gatctctgat aacacgcgtc ctatttattt 960 
cattcaaaca agcgttactg ttgcgataaa 1020 
caacacagcg ttatctggtt cgataaattc 1080 
aacaaaccta tccatagacg ataaccctgg 1140 
tcgcgatggc ggagctatct gtacacaatt 12 00 
tttcaccaac aatcaaggaa actggggagg 1260 
cctactcttc gcggaacaag gaaatatcgc 1320 
atttggtaga tacaacgcca tacattgtac 13 80 
taaggggtat acgactgctt tttttgatcc 1440 
tctaatcttt aatcccaatg cgaaccatca 1500 
cccagaagct tctgactacg aaaataattt 1560 
tcgcaatggt gtcctctcta tcgaggatcg 162 0 
aaaaggaggt atccttaaat tagggcatgc 1680 
gactccatca actagtgtag gctcccaggt 1740 
ctcgatctta gcaaaaggaa aagctcctac 1800 
tcctttcaca gaggacaata accctacaat 1860 
tgaggaaaac cgcgatccct acgacagtat 1920 
tcttctttct ttatcggatg taacagcacg 1980 
aagcttaaat gcgactgagc attacggtta 2 040 
gacgataaca acaacaaata acgcttctat 2100 
gtatgccaat tggactccct taggatataa 2160 
tacgactccc ctatggcaat cctttcatac 2220 
aactggtgat tctgatatcg agaggccttt 2280 
ctttgttcat caaaatagca tccccggggc 2340 
ttccttacaa gcatcctccg aaacttcttt 2400 
gttcttcacc cgcactaaag aaatcggatc 2460 
ctcttcactt tatgttgagc ttccgtggtt 2 520 
agcgtatggc tatggggacc atcacctcca 2580 
agaagggacg tgttatagcc atacattagc 2 640 
acagaaatcc tatcttcacc tcagcccgtt 2700 
aacagcgttc gaagagattg gtgacaatcc 2 760 
taatctgacc ttacctctag gaatccaagg 2820 
agaatggact ctagaacttt cttaccaacc 2880 
tgtcacgcta cttgcgagcg gaggttcctg 2940 
tgctttaggg tacaaagtcc acaatcaaac 3000 
ggattaccaa ggatcggtct cctcctcgac 3060 
cttaaaattc taaaataaaa gaacgataaa 3120 
atgagctacg ttagcccaat cggtagagga 3180 

3200 
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Figure 21 B: CPN 100626 Deduced Amino Acid Sequence 

Met Gin Val Phe Pro Lys Val Thr Leu Ser Leu Asp Tyr Ser Ala Asp 
15 10 15 

lie Ser Ser Ser Thr Leu Ser His Tyr Leu Asn Val Ala Ser Arg Met 
20 25 30 

Arg Phe Leu Thr lie Ser Asp Gin Asn Arg Lys lie Lys Glu Pro Leu 
35 40 45 

Val Ser Lys Thr Pro Pro Lys Phe Leu Phe Tyr Leu Gly Asn Phe Thr 
50 55 60 

Ala Cys Met Phe Gly Met Thr Pro Ala Val Tyr Ser Leu Gin Thr Asp 



Ser Leu Glu Lys Phe Ala Leu Glu Arg Asp Glu Glu Phe Arg Thr Ser 
85 90 95 

Phe Pro Leu Leu Asp Ser Leu Ser Thr Leu Thr Gly Phe Ser Pro lie 
100 105 110 

Thr Thr Phe Val Gly Asn Arg His Asn Ser Ser Gin Asp lie Val Leu 
115 120 125 

Ser Asn Tyr Lys Ser lie Asp Asn lie Leu Leu Leu Trp Thr Ser Ala 
130 135 140 

Gly Gly Ala Val Ser Cys Asn Asn Phe Leu Leu Ser Asn Val Glu Asp 
145 150 155 160 

His Ala Phe Phe Ser Lys Asn Leu Ala lie Gly Thr Gly Gly Ala lie 
165 170 175 

Ala Cys Gin Gly Ala Cys Thr lie Thr Lys Asn Arg Gly Pro Leu lie 
180 185 190 

Phe Phe Ser Asn Arg Gly Leu Asn Asn Ala Ser Thr Gly Gly Glu Thr 
195 200 205 

Arg Gly Gly Ala lie Ala Cys Asn Gly Asp Phe Thr lie Ser Gin Asn 
210 215 220 

Gin Gly Thr Phe Tyr Phe Val Asn Asn Ser Val Asn Asn Trp Gly Gly 
225 230 235 240 

Ala Leu Ser Thr Asn Gly His Cys Arg lie Gin Ser Asn Arg Ala Pro 
245 250 255 

Leu Leu Phe Phe Asn Asn Thr Ala Pro Ser Gly Gly Gly Ala Leu Arg 
260 265 270 

Ser Glu Asn Thr Thr lie Ser Asp Asn Thr Arg Pro lie Tyr Phe Lys 
275 280 285 
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Fig. 21 B (con't) 

Asn Asn Cys Gly Asn Asn Gly Gly Ala lie Gin Thr Ser Val Thr Val 
290 295 300 

Ala lie Lys Asn Asn Ser Gly Ser Val lie Phe Asn Asn Asn Thr Ala 
305 310 315 320 

Leu Ser Gly Ser lie Asn Ser Gly Asn Gly Ser Gly Gly Ala lie Tyr 
325 330 335 

Thr Thr Asn Leu Ser lie Asp Asp Asn Pro Gly Thr lie Leu Phe Asn 
340 345 350 

Asn Asn Tyr Cys He Arg Asp Gly Gly Ala He Cys Thr Gin Phe Leu 
355 360 365 

Thr He Lys Asn Ser Gly His Val Tyr Phe Thr Asn Asn Gin Gly Asn 
370 375 380 

Trp Gly Gly Ala Leu Met Leu Leu Gin Asp Ser Thr Cys Leu Leu Phe 
385 390 395 400 

Ala Glu Gin Gly Asn He Ala Phe Gin Asn Asn Glu Val Phe Leu Thr 
405 410 415 

Thr Phe Gly Arg Tyr Asn Ala He His Cys Thr Pro Asn Ser Asn Leu 
420 425 430 

Gin Leu Gly Ala Asn Lys Gly Tyr Thr Thr Ala Phe Phe Asp Pro lie 
435 440 445 

Glu His Gin His Pro Thr Thr Asn Pro Leu He Phe Asn Pro Asn Ala 
450 455 460 

Asn His Gin Gly Thr He Leu Phe Ser Ser Ala Tyr He Pro Glu Ala 
465 470 475 480 

Ser Asp Tyr Glu Asn Asn Phe He Ser Ser Ser Lys Asn Thr Ser Glu 
485 490 495 

Leu Arg Asn Gly Val Leu Ser lie Glu Asp Arg Ala Gly Trp Gin Phe 
500 505 510 

Tyr Lys Phe Thr Gin Lys Gly Gly He Leu Lys Leu Gly His Ala Ala 
515 520 525 

Ser He Ala Thr Thr Ala Asn Ser Glu Thr Pro Ser Thr Ser Val Gly 
530 535 540 

Ser Gin Val He He Asn Asn Leu Ala He Asn Leu Pro Ser He Leu 
545 550 555 560 

Ala Lys Gly Lys Ala Pro Thr Leu Trp He Arg Pro Leu Gin Ser Ser 



565 



570 



575 
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Ala Pro Phe Thr Glu Asp Asn Asn Pro Thr lie Thr Leu Ser Gly Pro 
580 585 590 

Leu Thr Leu Leu Asn Glu Glu Asn Arg Asp Pro Tyr Asp Ser lie Asp 
595 600 605 

Leu Ser Glu Pro Leu Gin Asn lie His Leu Leu Ser Leu Ser Asp Val 
610 615 620 

Thr Ala Arg His lie Asn Thr Asp Asn Phe His Pro Glu Ser Leu Asn 
625 630 635 640 

Ala Thr Glu His Tyr Gly Tyr Gin Gly lie Trp Ser Pro Tyr Trp Val 
645 650 655 

Glu Thr lie Thr Thr Thr Asn Asn Ala Ser He Glu Thr Ala Asn Thr 
660 665 670 

Leu Tyr Arg Ala Leu Tyr Ala Asn Trp Thr Pro Leu Gly Tyr Lys Val 
675 680 685 

Asn Pro Glu Tyr Gin Gly Asp Leu Ala Thr Thr Pro Leu Trp Gin Ser 
690 695 700 

Phe His Thr Met Phe Ser Leu Leu Arg Ser Tyr Asn Arg Thr Gly Asp 
705 710 715 720 

Ser Asp He Glu Arg Pro Phe Leu Glu He Gin Gly He Ala Asp Gly 
725 730 735 

Leu Phe Val His Gin Asn Ser He Pro Gly Ala Pro Gly Phe Arg He 
740 745 750 

Gin Ser Thr Gly Tyr Ser Leu Gin Ala Ser Ser Glu Thr Ser Leu His 
755 760 765 

Gin Lys He Ser Leu Gly Phe Ala Gin Phe Phe Thr Arg Thr Lys Glu 
770 775 780 

He Gly Ser Ser Asn Asn Val Ser Ala His Asn Thr Val Ser Ser Leu 
785 790 795 800 

Tyr Val Glu Leu Pro Trp Phe Gin Glu Ala Phe Ala Thr Ser His Ser 
805 810 815 

Leu Ala Tyr Gly Tyr Gly Asp His His Leu His Ala Tyr He Arg His 
820 825 830 

He Lys Asn Arg Ala Glu Gly Thr Cys Tyr Ser His Thr Leu Ala Ala 
835 840 845 

Ala He Gly Cys Ser Phe Pro Trp Gin Gin Lys Ser Tyr Leu His Leu 
850 855 860 
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Ser Pro Phe Val Gin Ala He Ala He Arg Ser His Gin Thr Ala Phe 
865 870 875 880 

Glu Glu He Gly Asp Asn Pro Arg Lys Phe Val Ser Gin Lys Pro Phe 
885 890 895 

Tyr Asn Leu Thr Leu Pro Leu Gly He Gin Gly Lys Trp Gin Ser Lys 
900 905 910 

Phe His Val Pro Thr Glu Trp Thr Leu Glu Leu Ser Tyr Gin Pro Val 
915 920 925 

Leu Tyr Gin Gin Asn Pro Gin He Gly Val Thr Leu Leu Ala Ser Gly 
930 935 940 

Gly Ser Trp Asp He Leu Gly His Asn Tyr Val Arg Asn Ala Leu Gly 
945 950 955 960 

Tyr Lys Val His Asn Gin Thr Ala Leu Phe Arg Ser Leu Asp Leu Phe 
965 970 975 

Leu Asp Tyr Gin Gly Ser Val Ser Ser Ser Thr Ser Thr His His Leu 
980 985 990 

Gin Ala Gly Ser Thr Leu Lys Phe 
995 1000 
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Figure 22 (RY-45) 

Restriction enzyme analysis of CPN1 00626 



Tsp509I 
TaqI | Cjel 
I I I 



BsaJI 
Styl 
CviJI | 
I I 



BsaAI 
Maell | 
Rsal | j 



MslI 
Cjel 
Hgal 



CviJI 
Hael 
Haelll 
Cac8I | 
I 



TCCTGAACTCCACTCGAAATTACTGATTAGCCAAGGTACGTGGACGACGCAGGCCACTCC 

1 + + + + + + 

AGGACTTGAGGTGAGCTTTAATGACTAATCGGTTCCATGCACCTGCTGCGTCCGGTGAGG 

Aarl 
MslI | 

Mae I I I Dpnl BspMI CviRI | j Maelll 

Tsp45I Sau3AI | Alwl j Nlalll j j Tsp45I 

I II I I II I I 

TGTGACCTACAATGCTTTAGGGATCAAAGTGAAAAATACCATGCAGGTGTTTCCTAAAGT 



61 



ACACTGGATGTTACGAAATCCCTAGTTTCACTTTTTATGGTACGTCCACAAAGGATTTCA 



BseMII 
I 



Bsp24I 
MboII CjePI | 
Ddel Acil | Cjel | j 

I II II I 

CACTCTCTCCTTAGATTACTCTGCGGATATTTCTTCCTCCACGCTGAGTCACTACTTAAA 

121 + + + + + + 180 

GTGAGAGAGGAATCTAATGAGACGCCTATAAAGAAGGAGGTGCGACTCAGTGATGAATTT 



Maelll 
Tsp45I 
Hinf I | 
Mnll j 
Ddel | j 
II 



Maell 
Msel | 
Plel | j 
I I I 



Cjel 

CjePI Maelll 
Bsp24l| Msel Tsp45I Msel NlalV 

II II II 

CGTGGCGAGTAGAATGAGATTTTTAACAATAAGTGACCAAAACAGAAAGATTAAGGAACC 

181 + + + + + + 240 

GCACCGCTCATCTTACTCTAAAAATTGTTATTCACTGGTTTTGTCTTTCTAATTCCTTGG 



Hinf I 
Mnll 
Plel | 
BseRl) j 
Bf al | | | 
II I 



Apol CviRI 
Tsp509I Cac8I | 

Sthl32I Hpyl78III | Sthl32I j 

Ddel Mnll | Aval | | CviJI | j 

I II III III 

TCTAGTGTCAAAGACTCCTCCTAAGTTTTTATTCTATCTCGGGAATTTCACAGCCTGCAT 

241 + + + + + + 300 

AGATCACAGTTTCTGAGGAGGATTCAAAAATAAGATAGAGCCCTTAAAGTGTCGGACGTA 
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Fig. 22 (con't) 



Hpyl78III 
Plel | 
Nlalll | j 
Nspl j j 
I 



BtSl 
Fokl 
PstI 
CviRI j 
Hinfl Sfcl j |TspRI 
I III I 



Plel Hinfl 
I 



Mnll 
I 



301 



GTTCGGGATGACTCCTGCAGTGTATAGTTTACAAACGGACTCCCTTGAAAAGTTTGCTTT 
CAAGCCCTACTGAGGACGTCACATATCAAATGTTTGCCTGAGGGAACTTTTCAAACGAAA 



Hin4I 
Alul 
CviJI 
MboII | 
Rsal | j 
Fokl | j j 
Xmnl Sunl | j | 
I I II I 



BplI 
BsaXI 
Hin4I j 
Mnll | j 
Hinfl | j j 
Ddel || || 
Plel | II || 
II ' ■ 



AGAGAGGGATGAAGAGTTTCGTACGAGCTTTCCTCTCTTAGACTCTCTCTCCACTCTTAC 

361 + + + + + + 420 

TCTCTCCCTACTTCTCAAAGCATGCTCGAAAGGAGAGAATCTGAGAGAGAGGTGAGAATG 



Hpyl78III Rsal 
Cjel Tsp509I Smll | TatI | 

Mmel | Maell Bce83I |CjeI | |MnlI | j 

II I I I I I I I I I 

AGGATTTTCTCCAATAACTACGTTTGTTGGAAATAGACATAATTCCTCTCAAGACATTGT 

421 + + + + + + 480 

TCCTAAAAGAGGTTATTGATGCAAACAACCTTTATCTGTATTAAGGAGAGTTCTGTAACA 

Bsp24I Alul 
Cjel CviJI CviJI 

Fokl MboII CjePI Acelll j Mwol | 

I II II II 

ACTTTCTAACTACAAGTCTATTGATAACATCCTTCTTCTTTGGACATCGGCTGGGGGAGC 

481 + + + + + + 540 

TGAAAGATTGATGTTCAGATAACTATTGTAGGAAGAAGAAACCTGTAGCCGACCCCCTCG 

Tsp509I MboII 
Cjel | Nlalll | 

CjePI | j Bbsl | j 

Bsp24l|| | EC057I MboII | [ j CjePI 

M! I I I I I I I 

TGTGTCCTGTAATAATTTCTTATTATCAAATGTTGAAGACCATGCCTTCTTCAGTAAAAA 
541 + + + + + + 600 

ACACAGGACATTATTAAAGAATAATAGTTTACAACTTCTGGTACGGAAGAAGTCATTTTT 



107/165 
SUBSTITUTE SHEET (RULE 26) 



WO 00/24765 



Title: CHLAMYDIA ANTIGENS 1 AND*"" 
CORRESPONDING DNA FRAGMENTS 
AND USES THEREOF 
Inventor(s): Andrew D. MURDIN et al 
DOCKET NO.: 032931/0251 



PCT/CA99/00992 



Fig. 22 (con't) 



Mnll 
Hpyl78III 
CviRI 
Cac 
CviJl 
NlalV 
Bpml 
ScrFI ' 
BsaJI | 
EcoRII | 
Cac8I 
CjePI 
Bsgl 

Hpyl78III BsrI BsmFI | 

Nrul Tthlllll |HhaI | j 

Thai Mnll | j Thai | I 

I 1 II Ml 

TCTCGCGATTGGGACTGGAGGCGCGATTGCTTGCCAGGGAGCCTGCACAATCACGAAGAA 

601 + + + + + + 660 

AGAGCGCTAACCCTGACCTCCGCGCTAACGAACGGTCCCTCGGACGTGTTAGTGCTTCTT 



MboII 
NlalV 
Avail 
EcoC-1091 
Psp5II 
Sau96I 
SimI 
I 



Mnll Taql 

I I 



Rsal 
Mnll | 
Tat I j 

I I 



661 + + + + + + 720 

ATCTCCTGGGGAATAAAAAAAGTCGTTAGCTCCAGAATTGTTACGCTCATGTCCTCCTCT 

BssSI BseRI BsmAI Hpyl78III 

II I I 

AACTCGTGGGGGTGCGATTGCCTGTAATGGAGACTTCACGATTTCTCAAAATCAAGGGAC 

721 + -t- + + + + 780 

TTGAGCACCCCCACGCTAACGGACATTACCTCTGAAGTGCTAAAGAGTTTTAGTTCCCTG 



Banll 
Bspl286I 
BsrI | 
Bmrl | CviJI j Fokl 
Mnll | |Hin4l| j Mnll 
Hindi | j | NlalV j j BseRI | BplI 
I I I I Ml II I 

TTTCTACTTTGTCAACAATTCCGTCAACAACTGGGGAGGAGCCCTCTCCACCAATGGACA 

781 + + + + + + 840 

AAAGATGAAACAGTTGTTAAGGCAGTTGTTGACCCCTCCTCGGGAGAGGTGGTTACCTGT 



Tsp509I 

Hindi | 
BsmFI | | 
I I I 
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TspRI 
Acil | 
Fnu4HI j 
Taul j 
BtsI | j 
I I I 



SfaNI 
Mwol | 
I I 
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Bspl286I 
NlalV 
Bmgl | 
BseSI | 
BanI | j 
III 



Msel 
Mnll | 
I I 
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BslI 
EcoNI 
Bf al | 
Mnll | j 
CviJI | j I 
I I I I 



CTGCCGCATCCAAAGCAACAGGGCACCTCTACTCTTTTTTAACAATACAGCCCCTAGTGG 

841 + + + + + + 900 

GACGGCGTAGGTTTCGTTGTCCCGTGGAGATGAGAAAAAATTGTTATGTCGGGGATCACC 



Pf 111081 
Hhal I 



Hpyl88IX 
Hgal| 
Dpnl j | 
Sau3AI | j j 

! I 



Thai 
Afllll | 
Mlul j 
I 



CjePI Msel 
I I 



AGGGGGTGCGCTTCGTAGTGAAAATACAACGATCTCTGATAACACGCGTCCTATTTATTT 

901 + + + + + + 960 

TCCCCCACGCGAAGCATCACTTTTATGTTGCTAGAGACTATTGTGCGCAGGATAAATAAA 



CviJI 
Haelll 
CjePI | 
Faul Nlaivj 
Sthl32l| Sau96I | j Tthlllll 

Taal | j Acil | j j Maelll Taal | 

I II I III III 

TAAGAACAACTGTGGGAACAATGGCGGGGCCATTCAAACAAGCGTTACTGTTGCGATAAA 

961 + + + + + + 1020 

ATTCTTGTTGACACCCTTGTTACCGCCCCGGTAAGTTTGTTCGCAATGACAACGCTATTT 



RleAI 
I 



SimI 
Neil | 
ScrFI j 
Mspl | j 

II ■ 



Apol 
Tsp5 0 9I 
TaqI I 
Drdll | | 
I 



AAATAACTCCGGGTCGGTGATTTTCAATAACAACACAGCGTTATCTGGTTCGATAAATTC 



TTTATTGAGGCCCAGCCACTAAAAGTTATTGTTGTGTCGCAATAGACCAAGCTATTTAAG 



Hpyl78III 
Drdll j 
Mnll | j 
Xranl | j j 
I I I 



ScrFI 
BsaJI | 
EcoRII j 
I I 



AGGAAATGGTTCAGGAGGGGCGATTTATACAACAAACCTATCCATAGACGATAACCCTGG 



TCCTTTACCAAGTCCTCCCCGCTAAATATGTTGTTTGGATAGGTATCTGCTATTGGGACC 
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Alul 
CviJI 
Ecil 
Acil | 
Bed 

Hpyl78III | 
BsmI Nrul j 
CviRI Thai j 

I I I 

AACTATTCTTTTCAATAATAACTACTGCATTCGCGATGGCGGAGCTATCTGTACACAATT 

1141 + + + + + + 1200 

TTGATAAGAAAAGTTATTATTGATGACGTAAGCGCTACCGCCTCGATAGACATGTGTTAA 



Tsp509I 
Rsal | 
BsrGI | j 
TatI 



BsaAI 
HphI 
Maell 
CviJI 
Hael 
Haelll 
Mscl 
Eael | 
I I 



BsrI 
Bmrl | 
Mnll | | 
I I I 



TTTGACAATCAAAAATAGTGGCCACGTATATTTCACCAACAATCAAGGAAACTGGGGAGG 

1201 + + + + + + 1260 

AAACTGTTAGTTTTTATCACCGGTGCATATAAAGTGGTTGTTAGTTCCTTTGACCCCTCC 



BsiHKAI 
Bspl286I 



MboII 
Aarl | 
AlwNI j 



Acil 
Earl 
BspMI Thai 



Sfcl 

t 

TGCTCTTATGCTCCTACAGGACAGCACCTGCCTACTCTTCGCGGAACAAGGAAATATCGC 

1261 + + + + + + 1320 

ACGAGAATACGAGGATGTCCTGTCGTGGACGGATGAGAAGCGCCTTGTTCCTTTATAGCG 



Rsal 

Cjel BsrGI | 

Cjel | Mnll Cjel j | 

Mnll HphI | HgiEIlj Cjel Cjel jTatl j 

I II III I I I I 

ATTTCAAAATAATGAGGTTTTCCTCACCACATTTGGTAGATACAACGCCATACATTGTAC 

1321 + + + + + + 1380 

TAAAGTTTTATTACTCCAAAAGGAGTGGTGTAAACCATCTATGTTGCGGTATGTAACATG 

Fokl 

Sf Cl 
Dpnl | 

Alul BstZ17I Sau3AI | j 

Cjel CviJI AccI | Alwl | | j 

I I II I I I I 

ACCAAATAGCAACTTACAACTTGGAGCTAATAAGGGGTATACGACTGCTTTTTTTGATCC 



TGGTTTATCGTTGAATGTTGAACCTCGATTATTCCCCATATGCTGACGAAAAAAACTAGG 
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Mmel 
Mnll | 
Msel I 

I I 



Bed 
Drdll | 

I I 



TATAGAACACCAACATCCAACTACAAATCCTCTAATCTTTAATCCCAATGCGAACCATCA 

1441 + + + + + + : 

ATATCTTGTGGTTGTAGGTTGATGTTTAGGAGATTAGAAATTAGGGTTACGCTTGGTAGT 

Pf 111081 

MboII Hpyl8 8IX 

Dpnl | Alul | 

Sau3AI | j CviJI j 

Eco57I | \ | CviJI Hindlll | j |Tsp509I 

I I II I I I I I ! 

GGGAACGATCTTATTTTCTTCAGCCTATATCCCAGAAGCTTCTGACTACGAAAATAATTT 

1501 + + + + + + 1560 

CCCTTGCTAGAATAAAAGAAGTCGGATATAGGGTCTTCGAAGACTGATGCTTTTATTAAA 



TaqI 
Alul 
CviJI 
Fnu4HI | 
Tsel| | 
I I 



Dpnl 
Faul 
Sau3AI 

Hpyl88IX Sthl32l| 
Acelll | Bcgl Mnll | j 

Bbvl | j Mnll BsrDI | Mnll TaqI jjj 

II I I II I i III 

CATTAGCAGCTCGAAAAATACCTCTGAACTTCGCAATGGTGTCCTCTCTATCGAGGATCG 

1561 + + 1- + + + 1620 

GTAATCGTCGAGCTTTTTATGGAGACTTGAAGCGTTACCACAGGAGAGATAGCTCCTAGC 

Fnu4HI 
Taul 
Acil | 
Nlalll | 
BciVI Nspl j 

Tsp5 09I | SphI j 

Mnll Msel | j Cac8I | j 

I I I I I 



Alwl Tsp509I Fokl 
Acil | Bed | Bcgl | 
II II'" 



TGCGGGATGGCAATTCTATAAGTTCACTCAAAAAGGAGGTATCCTTAAATTAGGGCATGC 
ACGCCCTACCGTTAAGATATTCAAGTGAGTTTTTCCTCCATAGGAATTTAATCCCGTACG 



Bed 
Hin4I | 

Hinfl | | ScrFI 

BseMII Ddel | j j BsaJI 

BstAPI | Hpyl88IX | j j EcoRII 

CviRI | | Plel | III Bfal NlalV j 

Mwol | Mwol j BsmAI || III Spel | CviJI | j 

II I I I I I I I I II II I 

GGCGAGTATTGCAACAACTGCCAACTCTGAGACTCCATCAACTAGTGTAGGCTCCCAGGT 

1681 + + + + + + 1740 

CCGCTCATAACGTTGTTGACGGTTGAGACTCTGAGGTAGTTGATCACATCCGAGGGTCCA 
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Msel 
Vspl 
I 



Mnll 
Ddel 
Dpnl 
Mnll 
Sau3AI | 
TaqI | | 
I I 



Aim 
CviJI 

I 



Msel 

I 

CATCATTAATAACCTTGCGATTAACCTCCCCTCGATCTTAGCAAAAGGAAAAGCTCCTAC 

1741 + + + + + + 1800 

GTAGTAATTATTGGAACGCTAATTGGAGGGGAGCTAGAATCGTTTTCCTTTTCGAGGATG 



Alwl 
Hin4I 
Dpnl 
NlalV 
BamHI 
BstYI 
Sau3AI 



Mnll 
BsiHKAI 
Bspl286I 
Bfal | 
Mnll I 



TspS09I 
I 

CTTGTGGATCCGTCCTCTACAATCTAGTGCTCCTTTCACAGAGGACAATAACCCTACAAT 



GAACACCTAGGCAGGAGATGTTAGATCACGAGGAAAGTGTCTCCTGTTATTGGGATGTTA 



Hpyl8 8IX 
Avail 
EcoO-1091 
PspSII 
Sau96I 
Sse8647I 
I 



Msel 
Mnll | 
Mnll | j 
III 



Pf 111081 
Dpnl 
Sau3AI 
Thai | 
Acil | j 
Alwl | || 
I I II 



BseMII 
Taal | 
I I 



TACTTTATCAGGTCCTCTGACACTCTTAAATGAGGAAAACCGCGATCCCTACGACAGTAT 

1861 + + + + + + 1920 

ATGAAATAGTCCAGGAGACTGTGAGAATTTACTCCTTTTGGCGCTAGGGATGCTGTCATA 



CviJI 
Ddel 
Hpyl8 8IX 
Dpnl 
Bglll | 
BstYI | 
Sau3AI j 
I I 



Maelll 
Hpyl88IX | 
I I 



Fokl 
Maell | 
I I 



1921 



AGATCTCTCTGAGCCTTTACAAAACATTCATCTTCTTTCTTTATCGGATGTAACAGCACG 
TCTAGAGAGACTCGGAAATGTTTTGTAAGTAGAAGAAAGAAATAGCCTACATTGTCGTGC 
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BseMII 
Msel | 
Alul 
CviJI 
Hindlll | 

Pokl Hpyl78III MM Ddel Taal 

I I I I II I I 

TCATATCAATACCGATAACTTTCATCCTGAAAGCTTAAATGCGACTGAGCATTACGGTTA 

1981 + + + + + + 2040 

AGTATAGTTATGGCTATTGAAAGTAGGACTTTCGAATTTACGCTGACTCGTAATGCCAAT 



BsmAI 

Bsal | Sfcl 

BsmAI j BsmAI | 

SfaNI jBsmBI BsmBlj 
III II 
T C AAGG CAT CTGGTCTC CTT ATTGGGTAGAG AC G ATAAC AAC AAC AAATAAC G CTT C TAT 

2041 + + + + + + 2100 

AGTTCCGTAGACCAGAGGAATAACCCATCTCTGCTATTGTTGTTGTTTATTGCGAAGATA 



Ban 1 1 
BsiHKAI 
Bspl28SI 
CjePI 
Sad 
Alul 
CviJI 
Mnll 
Tthlllll | 
Bcefl | | 
Sfcl | | | 
II II 



Muni 
Plel 
Tsp509I 
Mwol | 
I I 



BSU36I 
Hinfl Ddel 
I 



HaelV 
Hin4I 



AGAGACGGCAAACACCCTCTACAGAGCTCTGTATGCCAATTGGACTCCCTTAGGATATAA 

2101 + + + + + + 2160 

TCTCTGCCGTTTGTGGGAGATGTCTCGAGACATACGGTTAACCTGAGGGAATCCTATATT 



Dpnl 
Bglll 
BstYI 
Sau3AI | Hinfl 
BsaJI | | Pf 111081 | 

Styl j j Plel | j 

I I I I I I 

GGTCAATCCTGAATACCAAGGAGATCTTGCTACGACTCCCCTATGGCAATCCTTTCATAC 

2161 + + + + + + 2220 

CCAGTTAGGACTTATGGTTCCTCTAGAACGATGCTGAGGGGATACCGTTAGGAAAGTATG 



Hpyl78III 
CjePI | 
I I 
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Msel 
I 



Taql 
I 



EcoRV | 
Mnll | 
Hpyl88IX | j 
Hinf I | j j 

Tfil j i i 

Bsrl | III 
II 



CviJl 
Hael 
Haelll 
Hpyl78III 
Taql 
HphI | 



StuI 
I 



TATGTTCTCTCTATTAAGAAGTTATAATCGAACTGGTGATTCTGATATCGAGAGGCCTTT 

2221 + + + + + + 2280 

ATACAAGAGAGATAATTCTTCAATATTAGCTTGACCACTAAGACTATAGCTCTCCGGAAA 

NlalV 
Sthl32I 
Cvi JI | 
Sf aNI 
Neil 
ScrFI 
Smal 
BsaJI | 
Mspl | 
Neil | 

Sthl32I ScrFI | 

Apol Beef I | Aval| | 

Tsp509I CviJI Mnll | j BsaJI j j 

Ddel | Haelll Fokl | j Bpml j BsaJI | j j 

II I I I I II I I I I 

CTTAGAAATTCAAGGGATTGCCGACGGCCTCTTTGTTCATCAAAATAGCATCCCCGGGGC 

2281 + + + + + + 2340 

GAATCTTTAAGTTCCCTAACGGCTGCCGGAGAAACAAGTAGTTTTATCGTAGGGGCCCCG 



HaelV 
Hin4I 
Hinf I 
Tfil 
ScrFI | 
Banll | j 
Bspl286I j | 
EcoRII j | 
I I 



Mnll 
Tthlllll | 

BciVI Sf aNI | j 

Sfcl | Fokl Hpyl88IX | | j 

III I I II 

TCCAGGATTCCGTATCCAATCTACAGGGTATTCCTTACAAGCATCCTCCGAAACTTCTTT 

2341 + + + + + + 2400 

AGGTCCTAAGGCATAGGTTAGATGTCCCATAAGGAATGTTCGTAGGAGGCTTTGAAGAAA 
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Taal 
HphI | 

Bsu36I CviRI | j 

Hpyl88IX Ddel MboII | j j 

I I II I I 

ACATCAGAAAATCTCCTTAGGTTTTGCACAGTTCTTCACCCGCACTAAAGAAATCGGATC 

2401 + + + + + + 2460 

- TGTAGTCTTTTAGAGGAATCCAAAACGTGTCAAGAAGTGGGCGTGATTTCTTTAGCCTAG 



Faul Dpnl 
Sthl32l| Sau3AI | 
Acil | |Hpyl88IX| j 

I II III 



CviJT 
BsmAI | 
BsmBI j 
Tthlllll | 
Maell | | 

I I II 

II II 



CjePI 
Taal Earl | 

MboII | BsmAI | j 
II I I i 



BsaJI 
BstDSI NlalV 
Alul |DrdIlj 
CviJI j Mnllj 
II II 



AAGCAACAACGTCTCGGCTCACAATACAGTCTCTTCACTTTATGTTGAGCTTCCGTGGTT 

2461 + + + + + + 2520 

TTCGTTGTTGCAGAGCCGAGTGTTATGTCAGAGAAGTGAAATACAACTCGAAGGCACCAA 



CviRI 
CjePI 
CviJI 
Hael 
Haelll 
StuI 
Fokl | 
I I 



Fokl 
Bed | 
NlalV | j 
Avail | j j 
CviJI Sau96I j j j 
Taal RleAI | HphI | j j j BsmFI 

I II I II I I I 

CCAAGAGGCCTTTGCAACATCCCACAGTTTAGCGTATGGCTATGGGGACCATCACCTCCA 

2521 + + + + + + 2580 

GGTTCTCCGGAAACGTTGTAGGGTGTCAAATCGCATACCGATACCCCTGGTAGTGGAGGT 



Maelll 

Tsp45I Afllll BsmFI Fnu4HI 

Mnll |Hpyl78III Maell Cvi JI | Tsel | 

III I II II 

CGCCTACATCCGTCACATCAAGAACAGGGCAGAAGGGACGTGTTATAGCCATACATTAGC 

2581 + + + + + 1- 2640 

GCGGATGTAGGCAGTGTAGTTCTTGTCCCGTCTTCCCTGCACAATATCGGTATGTAATCG 



Alul 
CviJI 
Fnu4HI | 
Tsel| j 
II I 



Bbvl 
CviJI | 
Bbvl | j 
II 



Mnll 
BscGI | 
CviJI | j 
BbvCI | I | 
BpulOI j j | 
Ddel | j j 



BsaJI HphI 
Sty I MboII | 

I I I I I I I 

AGCAGCTATCGGCTGTTCTTTCCCTTGGCAACAGAAATCCTATCTTCACCTCAGCCCGTT 

2641 + + + + + + 2700 

TCGTCGATAGCCGACAAGAAAGGGAACCGTTGTCTTTAGGATAGAAGTGGAGTCGGGCAA 
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Muni 
Tsp509I 
BseMII | 
Sthl32I I 



Cjel 
Maell 
HphI | 
CviRI | j 
I I I 



Cjel 
MboII 
Maelll 
Tsp4 5I 
Tthlllll 
NspV | 
Taqi | 
Earl | | 
I I I 



CGTTCAGGCAATTGCAATACGTTCTCACCAAACAGCGTTCGAAGAGATTGGTGACAATCC 

2701 + + + + + + 2760 

GCAAGTCCGTTAACGTTATGCAAGAGTGGTTTGTCGCAAGCTTCTCTAACCACTGTTAGG 



BsaJI 
Styl 
Mnll | 
Hinf I 

Sthl32I Tfil 
HphI | BsmAI CviJI Hpyl88IX Hin4I Bfal | | j 

II II I I I I II 

CCGAAAGTTTGTCTCTCAAAAGCCTTTCTATAATCTGACCTTACCTCTAGGAATCCAAGG 

2761 + + + + + + 2820 

GGCTTTCAAACAGAGAGTTTTCGGAAAGATATTAGACTGGAATGGAGATCCTTAGGTTCC 



SfCl Hpyl78III 

Rsal | Bfal | 

Apol BsaAI | j Xbal | | 

Tsp509I Maell| j jpiel Hinf I ||| 

I I I I I I III 



Mspl 
BsaWI | 
BsrFI j 
PinAI | 
I I 



AAAATGGCAGTCAAAATTCCACGTACCTACAGAATGGACTCTAGAACTTTCTTACCAACC 



TTTTACCGTCAGTTTTAAGGTGCATGGATGTCTTACCTGAGATCTTGAAAGAATGGTTGG 



Rsal 
I 



Maelll 
Tsp45I 
BslI | 
I I 



ScrFI 
BsaJI | 
EcoRII | 
NlaIV| 
Ppil 
Acil | 



BsrBI 
Cac8I | 
Mnll | j 

I I ■ 



GGTACTCTATCAACAAAATCCCCAAATCGGTGTCACGCTACTTGCGAGCGGAGGTTCCTG 

2881 + + + + + + 2940 

CCATGAGATAGTTGTTTTAGGGGTTTAGCCACAGTGCGATGAACGCTCGCCTCCAAGGAC 
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CviJI 
Hael 
Haelll 
Bfal 
Avrll | 
BsaJI j 
Styl | 

EcoRV j | | MslI BsrDI Rsal MboII 

I II I I II I 

GGATATCCTAGGCCATAACTATGTTCGCAATGCTTTAGGGTACAAAGTCCACAATCAAAC 

2941 + + + 1- + + 3000 

CCTATAGGATCCGGTATTGATACAAGCGTTACGAAATCCCATGTTTCAGGTGTTAGTTTG 



BseRI 
BslI 
Dpnl 
Sau3AI 
BseRI | 
BsaJI | j | Bsal 
Styl j j j BsmAI 
Taqll | ill Alwl |TaqI 

I I II I III 
TGCGCTCTTCCGTTCTCTCGATCTATTCTTGGATTACCAAGGATCGGTCTCCTCCTCGAC 

3001 + + + + + + 3060 

ACGCGAGAAGGCAAGAGAGCTAGATAAGAACCTAATGGTTCCTAGCCAGAGGAGGAGCTG 



Hhal 
I 



Dpnl 
Sau3AI 
Hpyl78III | 
Earl | j 

Sapl TaqI j 
I II 



Apol 
Tsp509I 
Tthlllll | 

Mnll Msel| | 

Mnll | Bed Rsal | j | Tsp509I 

III I II I I 

ATCTACGCACCATCTCCAAGCAGGAAGTACCTTAAAATTCTAAAATAAAAGAACGATAAA 

3061 + + + + + + 3120 

TAGATGCGTGGTAGAGGTTCGTCCTTCATGGAATTTTAAGATTTTATTTTCTTGCTATTT 



CviJI 
Mwol 
Maell | 



Msel Alul | | 

Tsp509I | Hpyl88IX CviJI j j 
II I III 

ATTGAAATCTTTAGAATTAACAACTATCCGATGAGCTACGTTAGCCCAATCGGTAGAGGA 

3121 + + + + + + 3180 

TAACTTTAGAAATCTTAATTGTTGATAGGCTACTCGATGCAATCGGGTTAGCCATCTCCT 



Plel 
Mnll j Hinfl 
I I I 
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Dral 
Mnll 
Swal 
Msel | 
Apol | | 
Tsp509I j j 

I M 
CTCCCTCAAAATTTAAATAA 

3181 + + 3200 

GAGGGAGTTTTAAATTTATT 
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Figure 23: 

tagacactat aaaacaaatt atagacaaaa aatctagcat tgatttattc agaatatttc 60 

tttctatttg tgaacgagta tgcgcttttt ttgcttcgga atg ttg ctt cct ttt 115 

Met Leu Leu Pro Phe 



act ttt gta ttg get aat gaa ggt etc caa ctt cct ttg gag acc tat 163 

Thr Phe Val Leu Ala Asn Glu Gly Leu Gin Leu Pro Leu Glu Thr Tyr 
10 15 20 

att aca tta agt cct gaa tat caa gca gec cct caa gta ggg ttt act 211 

lie Thr Leu Ser Pro Glu Tyr Gin Ala Ala Pro Gin Val Gly Phe Thr 
25 30 35 

cat aac caa aat caa gat etc gca att gtc ggg aat cac aat gat ttc 259 

His Asn Gin Asn Gin Asp Leu Ala lie Val Gly Asn His Asn Asp Phe 
40 45 50 

ate ttg gac tat aag tac tat egg teg aat gga ggt get ctt acc tgt 307 

lie Leu Asp Tyr Lys Tyr Tyr Arg Ser Asn Gly Gly Ala Leu Thr Cys 
55 60 65 

aag aat ctt ctg ate tct gaa aat ata ggg aat gtc ttc ttt gag aag 355 

Lys Asn Leu Leu lie Ser Glu Asn lie Gly Asn Val Phe Phe Glu Lys 
70 75 80 85 

aat gtc tgt ccc aat tct ggc ggg gca att tat get get caa aat tgc 403 

Asn Val Cys Pro Asn Ser Gly Gly Ala He Tyr Ala Ala Gin Asn Cys 
90 95 100 

acg ate tec aag aat cag aac tat gca ttt act aca aac ttg gtc tct 451 

Thr He Ser Lys Asn Gin Asn Tyr Ala Phe Thr Thr Asn Leu Val Ser 
105 110 115 

gac aat cct aca gee act gcg gga tea eta ttg ggt gga get etc ttt 4 99 

Asp Asn Pro Thr Ala Thr Ala Gly Ser Leu Leu Gly Gly Ala Leu Phe 
120 125 130 

gec ata aat tgc tct att act aat aac eta gga cag gga act ttc gtt 547 

Ala He Asn Cys Ser He Thr Asn Asn Leu Gly Gin Gly Thr Phe Val 
135 140 145 

gac aat etc get tta aat aag ggg ggt gec etc tat act gag acg aac 5 95 

Asp Asn Leu Ala Leu Asn Lys Gly Gly Ala Leu Tyr Thr Glu Thr Asn 

150 155 160 165 

tta tct att aaa gac aat aaa ggc ccg ate ata ate aag cag aat egg 643 

Leu Ser He Lys Asp Asn Lys Gly Pro He He He Lys Gin Asn Arg 
170 175 180 

gca eta aat teg gac agt tta gga gga ggg att tat agt ggg aac tct 691 

Ala Leu Asn Ser Asp Ser Leu Gly Gly Gly He Tyr Ser Gly Asn Ser 
185 190 195 
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eta aat ata gag gga aat tct gga get ata cag ate aca age aac tct 739 

Leu Asn lie Glu Gly Asn Ser Gly Ala He Gin He Thr Ser Asn Ser 
200 205 210 

tea gga tct ggg gga ggc ata ttt tct ace caa aca etc acg ate tec 787 
Ser Gly Ser Gly Gly Gly He Phe Ser Thr Gin Thr Leu Thr He Ser 
215 220 225 

teg aat aaa aaa etc ata gaa ate agt gaa aat tec gcg ttc gca aat 835 
Ser Asn Lys Lys Leu He Glu He Ser Glu Asn Ser Ala Phe Ala Asn 
230 235 240 245 

aac tat gga teg aac ttc aat cca gga gga gga ggt ctt act acc ace 883 
Asn Tyr Gly Ser Asn Phe Asn Pro Gly Gly Gly Gly Leu Thr Thr Thr 
250 255 260 

ttt tgc acg ata ttg aac aac cga gaa ggg gta etc ttt aac aat aac 931 
Phe Cys Thr He Leu Asn Asn Arg Glu Gly Val Leu Phe Asn Asn Asn 
2S5 270 275 

caa age cag age aac ggt gga gee att cat gcg aaa tct ate att ate 979 
Gin Ser Gin Ser Asn Gly Gly Ala lie His Ala Lys Ser He He He 
280 285 290 

aaa gaa aat ggt cct gta tac ttt tta aat aac act gca act egg gga 1027 
Lys Glu Asn Gly Pro Val Tyr Phe Leu Asn Asn Thr Ala Thr Arg Gly 
295 300 305 

ggg get etc etc aac tta tea gca ggt tct gga aac gga age ttc ate 1075 
Gly Ala Leu Leu Asn Leu Ser Ala Gly Ser Gly Asn Gly Ser Phe He 
310 315 320 325 

tta tct gca gat aat gga gat att ate ttt aac aat aat acg gec tec 1123 
Leu Ser Ala Asp Asn Gly Asp He lie Phe Asn Asn Asn Thr Ala Ser 
330 335 340 

aag cat gee etc aat cct cca tac aga aac gec att cac teg act cct 1171 
Lys His Ala Leu Asn Pro Pro Tyr Arg Asn Ala He His Ser Thr Pro 
345 350 355 

aat atg aat ctg caa ata gga gee cgt ccc ggc tat cga gtg ctg ttc 1219 
Asn Met Asn Leu Gin He Gly Ala Arg Pro Gly Tyr Arg Val Leu Phe 
360 365 370 

tat gat ccc ata gaa cat gag etc cct tec tec ttc ccc ata etc ttt 1267 
Tyr Asp Pro He Glu His Glu Leu Pro Ser Ser Phe Pro lie Leu Phe 
375 380 385 

aat ttc gaa acc ggt cat aca ggt aca gtt tta ttt tea ggg gaa cat 1315 
Asn Phe Glu Thr Gly His Thr Gly Thr Val Leu Phe Ser Gly Glu His 
390 395 400 405 
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gta cac cag aac ttt acc gat gaa atg aat ttc ttt tec tat tta agg 

Val His Gin Asn Phe Thr Asp Glu Met Asn Phe Phe Ser Tyr Leu Arg 
410 415 420 

aac act teg gaa eta cgt caa gga gtc ctt get gtt gaa gat ggt gcg 
Asn Thr Ser Glu Leu Arg Gin Gly Val Leu Ala Val Glu Asp Gly Ala 
425 430 435 

ggg ctg gee tgc tat aag ttc ttc caa cga gga ggc act eta ctt eta 
Gly Leu Ala Cys Tyr Lys Phe Phe Gin Arg Gly Gly Thr Leu Leu Leu 
440 445 450 

ggt caa ggt gcg gtg ate acg aca gca gga acg att ccc aca cca tec 
Gly Gin Gly Ala Val lie Thr Thr Ala Gly Thr lie Pro Thr Pro Ser 
455 460 465 

tea aca cca acg aca gta gga agt act ata act tta aat cac att gee 
Ser Thr Pro Thr Thr Val Gly Ser Thr lie Thr Leu Asn His lie Ala 
470 475 480 485 

att gac ctt cct tct att ctt tct ttt caa get cag get cca aaa att 
lie Asp Leu Pro Ser lie Leu Ser Phe Gin Ala Gin Ala Pro Lys lie 
490 495 500 

tgg att tac ccc aca aaa aca gga tct acc tat act gaa gat tec aac 
Trp lie Tyr Pro Thr Lys Thr Gly Ser Thr Tyr Thr Glu Asp Ser Asn 
505 510 515 

ccg aca ate aca ate tea gga act etc acc tta cgc aac age aac aac 
Pro Thr lie Thr lie Ser Gly Thr Leu Thr Leu Arg Asn Ser Asn Asn 
520 525 530 

gaa gat ccc tac gat agt ctg gat etc teg cac tct ctt gag aaa gtt 
Glu Asp Pro Tyr Asp Ser Leu Asp Leu Ser His Ser Leu Glu Lys Val 
535 540 545 

ccc ctt ctt tat att gtc gat gtc get gca caa aaa att aac tct teg 
Pro Leu Leu Tyr lie Val Asp Val Ala Ala Gin Lys lie Asn Ser Ser 
550 555 560 565 

caa ctg gat eta tec aca tta aat tct ggc gaa cac tat ggg tat caa 
Gin Leu Asp Leu Ser Thr Leu Asn Ser Gly Glu His Tyr Gly Tyr Gin 
570 575 580 

ggc ate tgg teg acc tat tgg gta gaa act aca aca ate acg aac cct 
Gly lie Trp Ser Thr Tyr Trp Val Glu Thr Thr Thr lie Thr Asn Pro 
585 590 595 



aca tct eta eta ggc gcg aat aca aaa cac aag ctg etc tat gca aac 
Thr Ser Leu Leu Gly Ala Asn Thr Lys His Lys Leu Leu Tyr Ala Asn 
600 605 610 
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tgg tct cct eta ggc tac cgt cct cat ccc gaa cgt cga gga gaa ttc 1987 
Trp Ser Pro Leu Gly Tyr Arg Pro His Pro Glu Arg Arg Gly Glu Phe 
615 620 625 



att acg aat gec ttg tgg caa teg gca tat acg get ctt gca gga etc 

lie Thr Asn Ala Leu Trp Gin Ser Ala Tyr Thr Ala Leu Ala Gly Leu 

630 635 640 645 

cac tec etc tec tec tgg gat gaa gag aag ggt cat gca get tec eta 

His Ser Leu Ser Ser Trp Asp Glu Glu Lys Gly His Ala Ala Ser Leu 
650 655 660 



caa ggc att ggt ctt ctg gtt cat caa aaa gac aaa aac ggt ttt aag 

Gin Gly lie Gly Leu Leu Val His Gin Lys Asp Lys Asn Gly Phe Lys 

665 670 675 

gga ttt cgt agt cat atg aca ggt tat agt get ace ace gaa gca acc 

Gly Phe Arg Ser His Met Thr Gly Tyr Ser Ala Thr Thr Glu Ala Thr 

680 685 690 

tct tct caa agt ccg aat ttc tct tta gga ttt get cag ttc ttc tec 

Ser Ser Gin Ser Pro Asn Phe Ser Leu Gly Phe Ala Gin Phe Phe Ser 

695 700 705 



aaa get aaa gaa cat gaa tct caa aat age acg tec tct cac cac tat 2275 
Lys Ala Lys Glu His Glu Ser Gin Asn Ser Thr Ser Ser His His Tyr 
710 715 720 725 



ttc tct gga atg tgc ata gca aaa tac tct ctt caa aga gtg ata cgt 2323 
Phe Ser Gly Met Cys lie Ala Lys Tyr Ser Leu Gin Arg Val lie Arg 
730 735 740 



eta tct gtg tct ctt get tat atg ttt acc teg gaa cat acc cat aca 2 371 
Leu Ser Val Ser Leu Ala Tyr Met Phe Thr Ser Glu His Thr His Thr 
745 750 755 



atg tat cag ggt etc ctg gaa ggg aac tct cag gga tct ttc cac aac 
Met Tyr Gin Gly Leu Leu Glu Gly Asn Ser Gin Gly Ser Phe His Asn 
760 765 770 



cat acc tta gca ggg get etc tec tgt gtt ttc tta cct caa cct cac 
His Thr Leu Ala Gly Ala Leu Ser Cys Val Phe Leu Pro Gin Pro His 
775 780 785 



ggc gag tec ctg cag ate tat ccc ttt att act gee tta gee ate cga 2515 
Gly Glu Ser Leu Gin lie Tyr Pro Phe lie Thr Ala Leu Ala lie Arg 
790 795 800 805 



gga aat ctt get gcg ttt caa gaa tct 
Gly Asn Leu Ala Ala Phe Gin Glu Ser 
810 



gga gac cat get egg gaa ttt 2563 
Gly Asp His Ala Arg Glu Phe 
815 820 
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tec eta cac cgc ccc eta acg gac gtc tec etc cct gta 
Ser Leu His Arg Pro Leu Thr Asp Val Ser Leu Pro Val 
825 830 

get tct tgg aag aac cac cac cga gtt ccc eta gtc tgg 
Ala Ser Trp Lys Asn His His Arg Val Pro Leu Val Trp 
840 845 850 



att tec tat cgc tct act etc 
lie Ser Tyr Arg Ser Thr Leu 
855 860 



tat agg caa gat cct gaa 
Tyr Arg Gin Asp Pro Glu 
865 
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gga ate cgc 2611 

Gly lie Arg 

835 

etc aca gaa 2659 
Leu Thr Glu 



etc cac teg 2707 
Leu His Ser 



aaa tta ctg att age caa ggt acg tgg 
Lys Leu Leu lie Ser Gin Gly Thr Trp 
870 875 

ace tac aat get tta ggg ate aaa gtg 
Thr Tyr Asn Ala Leu Gly lie Lys Val 
890 

cct aaa gtc act etc tec tta gat tac 
Pro Lys Val Thr Leu Ser Leu Asp Tyr 
905 910 



acg acg cag gee act cct gtg 2755 
Thr Thr Gin Ala Thr Pro Val 
880 885 

aaa aat ace atg cag gtg ttt 2803 
Lys Asn Thr Met Gin Val Phe 
895 900 

tct gcg gat att tct tec tec 2851 
Ser Ala Asp lie Ser Ser Ser 
915 



acg ctg agt cac tac tta aac gtg gcg agt aga atg aga ttt 
Thr Leu Ser His Tyr Leu Asn Val Ala Ser Arg Met Arg Phe 
920 925 930 



taacaataag tgaccaaaac agaaagatta aggaacctct agtgtcaaag actcctccta 2953 
agtttttatt etatcteggg aatttcacag cctgcatgtt egggatg 3000 
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Figure 24 (RY-46) 

Restriction enzyme analysis of CPN1 00628 



HaelV 

Tsp509I Hin4I Bfal Hpyl88IX Sspl 

I II I I 

TAGACACTATAAAACAAATTATAGACAAAAAATCTAGCATTGATTTATTCAGAATATTTC 

! + + + + + + 60 

ATCTGTGATATTTTGTTTAATATCTGTTTTTTAGATCGTAACTAAATAAGTCTTATAAAG 

Hpyl8 8IX 
Hhal Mwol | 
i I I 

TTTCTATTTGTGAACGAGTATGCGCTTTTTTTGCTTCGGAATGTTGCTTCCTTTTACTTT 

6i + + + + + + 120 

AAAGATAAACACTTGCTCATACGCGAAAAAAACGAAGCCTTACAACGAAGGAAAATGAAA 

Bce83I 
Hpyl78III | 
Msel | j 

Bsal Bsal Cjel| j j 

CviJI Cjel BsmAI BsmAI Mmel j j | j 

II II I I I I I 

TGTATTGGCTAATGAAGGTCTCCAACTTCCTTTGGAGACCTATATTACATTAAGTCCTGA 

121 + + + + + + 180 

ACATAACCGATTACTTCCAGAGGTTGAAGGAAACCTCTGGATATAATGTAATTCAGGACT 



Mnll 

Tthlllll | 

Bbvl | j 

CviJI Bsllj j 

Fnu4HI I ECONI | j | 

Tsel | j Smll III j 
III ill 



Muni 
Tsp509I 
Sthl32I 
Dpnl | 
Bglll | j 



BstYI 
Sau3AI 
Hpyl78III | 

I I 



ATATCAAGCAGCCCCTCAAGTAGGGTTTACTCATAACCAAAATCAAGATCTCGCAATTGT 



TATAGTTCGTCGGGGAGTTCATCCCAAATGAGTATTGGTTTTAGTTCTAGAGCGTTAACA 



Hinf I 
Tf il 
Hpyl78III | 
I 



Rsal 
Seal 
Tat I | 
Taqll | | 
I I I 



BsiEI 
Mnll 
Taql 
I 



BsiHKAI 
Bspl28€I 
I 



CGGGAATCACAATGATTTCATCTTGGACTATAAGTACTATCGGTCGAATGGAGGTGCTCT 

241 + + + + + + 300 

G CCCTTAGTGTTACTAAAGTAGAAC CTGATATT CATGATAG C CAGCTTAC CTC C ACG AGA 
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24 (con't) 



Hpyl88IX 
Dpnl 
Sau3AI 
Hpyl8 8IX| 
Hinf I | j 

MboII Tfil j j 

II II 



Bbsl 
MboII 



BsmFI 
I 



TACCTGTAAGAATCTTCTGATCTCTGAAAATATAGGGAA.TGTCTTCTTTGAGAAGAATGT 
ATGGACATTCTTAGAAGACTAGAGACTTTTATATCCCTTACAGAAGAAACTCTTCTTACA 



BslI 
Faul 
Sthl32I | 
Tsp509I j 
MboII | j 
I I 



Fnu4HI 
Tsel 
BstAPI 
Tsp509I | 
Bbvl | | 

Acil | j Mwol 

I I 



Dpnl 
Sau3AI j 
CviRI | | 



Hpyl8 8IX 
Hinf I | 

|Tsp509I | | | Tfil j 

III II 
CTGTCCCAATTCTGG CGGGG C AATTTATG CTG C TCAAAATTG CACGATCTCCAAG AATC A 

361 + + + + + + 420 

GACAGGGTTAAGACCGCCCCGTTAAATACGACGAGTTTTAACGTGCTAGAGGTTCTTAGT 

Dpnl 
Sau3AI 
Taqll 
TspRI | 
Acil 
Btsl 
AlwNI j 
CjePI 
CviJI | 

Bsal Faul | j 

Nsil BsmAI Sthl32l||| 

CviRI | Hpyl88IX| Sfcl Jill 

II II I I I I I 

GAACTATGCATTTACTACAAACTTGGTCTCTGACAATCCTACAGCCACTGCGGGATCACT 

421 + + + + + + 480 

CTTGATACGTAAATGATGTTTGAACCAGAGACTGTTAGGATGTCGGTGACGCCCTAGTGA 

Banll BslI 

BsiHKAI EcoNI | 

Bspl286I Bfal | j 

Sad Avrll | j j 

Alul | CjePI BsaJI | | | 

Alwl CviJI j Mwol | Tsp509I Styl j j j 

I I I II I II I I 

ATTGGGTGGAGCTCTCTTTGCCATAAATTGCTCTATTACTAATAACCTAGGACAGGGAAC 

481 + + + \- +■ + 540 

TAACCCACCTCGAGAGAAACGGTATTTAACGAGATAATGATTATTGGATCCTGTCCCTTG 
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Fig. 24 (con't) 



BsmAI 
BsmBI 
Bspl286I 
BseMII | 
Bmgl | I 
BseSI j j 

Dral NlaIV| j j | Mnll 

Hindi Msel| BanI | j j j |DdeI | 

I II I I I I I I I I 

TTTCGTTGACAATCTCGCTTTAAATAAGGGGGGTGCCCTCTATACTGAGACGAACTTATC 

541 + + + + + + 600 

AAAGCAACTGTTAGAGCGAAATTTATTC C CC C C ACGGG AGATATGACTCTG CTTG AATAG 



Hinf I 
Bcgl 
Sthl32I 
BsaBI 



Sthl32I 
Dpnl 
Sau3AI 
CviJI | 
Haelll j 
Sau96l| | 
I I I 



Tf il 
I 



Hpyl8 8IX 
Apol 



Tsp509I 
Tthlllll 
Bspl286I | 
Bmgl | j 
BseSI j j 
II 



Mnll 
Taal 
I 



TATTAAAGACAATAAAGGCCCGATCATAATCAAGCAGAATCGGGCACTAAATTCGGACAG 

601 + + + + + + 660 

ATAATTTCTGTTATTTCCGGGCTAGTATTAGTTCGTCTTAGCCCGTGATTTAAGCCTGTC 



Alul 
CviJI 

Bcgl Hpyl78III | 

CjePI | CjePI Apol | j 

Mnll | j BseRI Mnll | Tsp509I j j 

III I II III 

TTTAGGAGGAGGGATTTATAGTGGGAACTCTCTAAATATAGAGGGAAATTCTGGAGCTAT 

661 + + + + + + 720 

AAATCCTCCTCCCTAAATATCACCCTTGAGAGATTTATATCTCCCTTTAAGACCTCGATA 



Bpml 
MboII 
Dpnl | 
Eco57I | | 
Sau3AI | j 



AlwNI 
Mnll 
Dpnl 
Tthlllll 

BstYI 
Sau3AI 
Earl | 
Hpyl7 8III j 
I I 



Sau3AI 
Hpyl78III | 
BseRI | j 

I II 



ACAGATCACAAGCAACTCTTCAGGATCTGGGGGAGGCATATTTTCTACCCAAACACTCAC 



TGTCTAGTGTTCGTTGAGAAGTCCTAGACCCCCTCCGTATAAAAGATGGGTTTGTGAGTG 
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Fig. 24 (con't) 



TaqI 
Tthlllll | 
Dpnl | I 

I I I 



Apol 
Tsp50 9I Thai 
TspRI Acil | 
I t I 



GATCTCCTCGAATAAAAAACTCATAGAAATCAGTGAAAATTCCGCGTTCGCAAATAACTA 
CTAGAGGAGCTTATTTTTTGAGTATCTTTAGTCACTTTTAAGGCGCAAGCGTTTATTGAT 



BslI 
Mnll 
ScrFI 
Mnll 



TaqI EcoRII | 

Dpnl | Mnll | I 

Sau3AI | | Alwl ] j | 
II! I Ml 



BseRI 
BseRI | 



CviRI 
I 



841 



TGGATCGAACTTCAATCCAGGAGGAGGAGGTCTTACTACCACCTTTTGCACGATATTGAA 
h + + + + + : 

ACCTAGCTTGAAGTTAGGTCCTCCTCCTCCAGAATGATGGTGGAAAACGTGCTATAACTT 

CviJI Taal CviJI 

BslI Rsal Msel CjePI | Xcml |NlaIV| Mwol 

III II I I II I 

CAACCGAGAAGGGGTACTCTTTAACAATAACCAAAGCCAGAGCAACGGTGGAGCCATTCA 

901 + + + + + + 

GTTGGCTCTTCCCCATGAGAAATTGTTATTGGTTTCGGTCTCGTTGCCACCTCGGTAAGT 

Aval 
Mnll 
TspRI 
CviRI | 

CjePI Avail BstZ17I Dral BtsI j j 

Nlalll I Sau96I Accl] Msel | Sthl32l| j j 

M I II II II I I 

TGCGAAATCTATCATTATCAAAGAAAATGGTCCTGTATACTTTTTAAATAACACTGCAAC 

961 + + + + + + 

ACGCTTTAGATAGTAATAGTTTCTTTTACCAGGACATATGAAAAATTTATTGTGACGTTG 



Banll 
Bspl286I 
CviJI | 
Hin4l| j 
BseRI | | j 

I I I 



Hpyl78III 
BplI | 
Mnll | AlwNI | 
II II 



Alul 
CviJI 
Hindi I I | 



Sf cl 

I 



TCGGGGAGGGGCTCTCCTCAACTTATCAGCAGGTTCTGGAAACGGAAGCTTCATCTTATC 

+ + + + + + 

AGCCCCTCCCCGAGAGGAGTTGAATAGTCGTCCAAGACCTTTGCCTTCGAAGTAGAATAG 
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PstI 
CviRI | 
i I 
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I 



CviJI 
Haelll 
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Beef I 
Nlalll | 
Nspl | 
SphI j 
Mnll | j 
CacQI | j |TthlllII 
I II I 



TGCAGATAATGGAGATATTATCTTTAACAATAATACGGCCTCCAAGCATGCCCTCAATCC 



ACGTCTATTACCTCTATAATAGAAATTGTTATTATGCCGGAGGTTCGTACGGGAGTTAGG 



Mnll Mnll 
I I 



Hinf I 
Taqi | 
Plel | | 
I I 



CviRI 
BsmFI | 
Hinf I | j 

Tf il | | 

II 



Banll 

BSCGI 
Bspl286I Mspl 
CviJI | Neil 
NlaIV| jscrFI 
II I I 



TCCATACAGAAACGCCATTCACTCGACTCCTAATATGAATCTGCAAATAGGAGCCCGTCC 



AGGTATGTCTTTGCGGTAAGTGAGCTG AGGATTATACTTAGACGTTTATC CTCGGG CAGG 



Taqi 
Sthl32l| 
CviJI | | 
Sthl32I | j | 
I I 1 



Dpnl 
Sau3AI | 
Alwl | j 

I I 



Nl 



Banll 
BsiHKAI 
Bspl286I 
SacI 
Alul 
CviJI 
BsaXI 
III 
Alol | 
Ppil| 
' I 



Mnll 
I 



CGGCTATCGAGTGCTGTTCTATGATCCCATAGAACATGAGCTCCCTTCCTCCTTCCCCAT 

1201 + + + + + + 1260 

GCCGATAGCTCACGACAAGATACTAGGGTATCTTGTACTCGAGGGAAGGAGGAAGGGGTA 



Rsal 

Mspl Nlalllj 

BsaWl| Nspl j 

BsrFI j BsrGI | j 

PinAI j TatI j j 

Tsp509I NspV | ] Taal Af 1III | j j 

Msel| Taqi || Rsal | BspLUllI ||j 

II I II II I Ml 
ACTCTTTAATTTCGAAACCGGTCATACAGGTACAGTTTTATTTTCAGGGGAACATGTACA 
1261 + + + + + + 1320 



TGAGAAATTAAAGCTTTGGCCAGTATGTCCATGTCAAAATAAAAGTCCCCTTGTACATGT 
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Hpyl88IX Maell 
I I 



Fig. 24 (con't) 

BslI 

Apol Msel | 

Tsp509I ECONI | j 

I III 
CCAGAACTTTACCGATGAAATGAATTTCTTTTCCTATTTAAGGAACACTTCGGAACTACG 

1321 + + + + + + 1380 

GGTCTTGAAATGGCTACTTTACTTAAAGAAAAGGATAAATTCCTTGTGAAGCCTTGATGC 

MboII 
Cac8I 
CviJI 
Hael 
Haelll 
Cac8I 
Mwol 



Hinf I 
I 



Bed 

Faul [ CviJT | 

Sthl32I | [ MboII | | 
Plel | | |AciI | | | 

I II I III! 



Mnll 
Mnll | 
I I 



TCAAGGAGTCCTTGCTGTTGAAGATGGTGCGGGGCTGGCCTGCTATAAGTTCTTCCAACG 

1381 + + + + + + 1440 

AGTTCCTCAGGAACGACAACTTCTACCACGCCCCGACCGGACGATATTCAAGAAGGTTGC 



Mmel 
Bf al | 
BseRI | j 

I I I 



Hpyl78III 

Dpnl 
Bell | 
Sau3AI | 
Acil | j 
I I I 



HphI 
I 



Hinf I 
Tf il 
Fokl | 
I I 



AGGAGGCACTCTACTTCTAGGTCAAGGTGCGGTGATCACGACAGCAGGAACGATTCCCAC 

1441 + + + + + + 1500 

TCCTCCGTGAGATGAAGATCCAGTTCCACGCCACTAGTGCTGTCGTCCTTGCTAAGGGTG 



BsrDI 
CjePI 

Mnll Rsal Dral | 

Cjel RleAI j Seal Msel | j 

BccI | Bsbl | j Taal TatI | Cjel j j j 

II III I II I II I 

ACCATCCTCAACACCAACGACAGTAGGAAGTACTATAACTTTAAATCACATTGCCATTGA 

1501 + + + + + + 1560 

TGGTAGGAGTTGTGGTTGCTGTCATCCTTCATGATATTGAAATTTAGTGTAACGGTAACT 



BstXI 
BseMII | 
BpulOI Apol 
Ddel Tsp509I 
Alul | NlalV ] 
CjePI CviJI j CviJI | j 
1 I I II I 

CCTTCCTTCTATTCTTTCTTTTCAAGCTCAGGCTCCAAAAATTTGGATTTACCCCACAAA 

1561 + + + + + + 1620 

GGAAGGAAGATAAGAAAGAAAAGTTCGAGTCCGAGGTTTTTAAACCTAAATGGGGTGTTT 
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Alwl 

RleAI | Mmel 
Dpnl | I Eco57I HphI | 

BstYI | I j Hinfl Sthl32l| Hpyl78III | 

Sau3AI j j | Tfil MboII | j Ddel | j BseMII 

I I I I I II! Ill I 

AACAGGATCTACCTATACTGAAGATTCCAACCCGACAATCACAATCTCAGGAACTCTCAC 

1621 + + + + + + 1680 

TTGTCCTAGATGGATATGACTTCTAAGGTTGGGCTGTTAGTGTTAGAGTCCTTGAGAGTG 



Dpnl 
BstYI 
Sau3AI 
Hpyl78III j 
MboII 
Pflll08I | 



Dpnl 
BstYI | 
Sau3AI j 
Alwl | j 

I I I 



Hpyl78III 
Alwl Smll| 
I I I 



CTTACGCAACAGCAACAACGAAGATCCCTACGATAGTCTGGATCTCTCGCACTCTCTTGA 

1681 + + + + + + 1740 

GAATGCGTTGTCGTTGTTGCTTCTAGGGATGCTATCAGACCTAGAGAGCGTGAGAGAACT 



Bbvl 
Bagl| 

Bce83I | | 
Bcgl | j j TaqI 
lilt I 



Bcgl 

CviRI Msel | 

Fnu4HI | MboII | | 
Tsel| |Tsp509I j j 
III III 



GAAAGTTCCCCTTCTTTATATTGTCGATGTCGCTGCACAAAAAATTAACTCTTCGCAACT 

1741 + + + + + + 1800 

CTTTCAAGGGGAAGAAATATAACAGCTACAGCGACGTGTTTTTTAATTGAGAAGCGTTGA 



BsrI 
Dpnl 
BstYI | 
Sau3AI j 
I I 



Apol 
Tsp509I 
Alwl Msel | 
I I 



Hindi 
Sf aNI 
Accl | 
Taql| 
Sail | | 
III 



GGATCTATCCACATTAAATTCTGGCGAACACTATGGGTATCAAGGCATCTGGTCGACCTA 

1801 + + + + + + 1860 

CCTAGATAGGTGTAATTTAAGACCGCTTGTGATACCCATAGTTCCGTAGACCAGCTGGAT 



Bbvl 
Hhal | 

Hpyl78III Bfal Thai | 

I III 
TTGGGTAGAAACTACAACAATCACGAACCCTACATCTCTACTAGGCGCGAATACAAAACA 

1861 + + + + + + 1920 

AACCCATCTTTGATGTTGTTAGTGCTTGGGATGTAGAGATGATCCGCGCTTATGTTTTGT 
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Fnu4HI 
Alul | 
Cvi JI j 
Tsel j 
II 



Taal 
Mnll 
CviJI 
Fokl 
Bf al 
Bsal | 



BseRI 
CviRI | 
I I 



BsmAI 
BsrI | 
I 



TaqI 
Sthl32I 
Maell 
Mnll | 
Mnll | j 
Hpyl78III | I j 
I III 



CAAGCTGCTCTATGCAAACTGGTCTCCTCTAGGCTACCGTCCTCATCCCGAACGTCGAGG 

1921 + + + + + + 

GTTCGACGAGATACGTTTGACCAGAGGAGATCCGATGGCAGGAGTAGGGCTTGCAGCTCC 

BseRI 
Beef I | 
Hinf I 

Apol CjePI | 

EcoRI CviJI CviRI | 

Tsp509I BseRI BsmI Mwol | Plel | j 

I II I I I I I 

AGAATTCATTACGAATGCCTTGTGGCAATCGGCATATACGGCTCTTGCAGGACTCCACTC 

1981 + + + + + + 2040 

TCTTAAGTAATGCTTACGGAACACCGTTAGCCGTATATGCCGAGAACGTCCTGAGGTGAG 



Alul 
CviJI 
Fnu4HI 
CviRI | 
Nlalll | 
Tsel j 

MboII | j | Bbsl 
Fokl | jjj MboII 
Siml j III Bbvl | 
I I II I II 

CCTCTCCTCCTGGGATGAAGAGAAGGGTCATGCAGCTTCCCTACAAGGCATTGGTCTTCT 

2041 + + + + + + 2100 

GGAGAGGAGGACCCTACTTCTCTTCCCAGTACGTCGAAGGGATGTTCCGTAACCAGAAGA 



Earl 
Mnll | 
ScrFl| | 
BsaJI | | J CjePI 
BsaXI EcoRIl| j j | Mnll | 

Mill I I 



Msel 

Drdll Taal | Pflll08I Ndel 

I II II 

GGTTCATCAAAAAGACAAAAACGGTTTTAAGGGATTTCGTAGTCATATGACAGGTTATAG 

2101 + + + + + + 2160 

CCAAGTAGTTTTTCTGTTTTTGCCAAAATTCCCTAAAGCATCAGTATACTGTCCAATATC 

Apol 
Tsp509I 
Hpyl8 8IX | 

Mnll | j Ddel 
MboII Earl | | j MboII | 

I I I I I M 

TGCTACCACCGAAGCAACCTCTTCTCAAAGTCCGAATTTCTCTTTAGGATTTGCTCAGTT 

2161 + + + + + + 2220 

ACGATGGTGGCTTCGTTGGAGAAGAGTTTCAGGCTTAAAGAGAAATCCTAAACGAGTCAA 
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Alul 
CviJI 
BseMII | 

I I 



Hinf I 
Nlalll 
Tf il 

I 



CjePI 
HphI | 
Maell I 
I I 



Mnll 

I 



CTTCTCCAAAGCTAAAGAACATGAATCTCAAAATAGCACGTCCTCTCACCACTATTTCTC 

2221 + + + + + + 2280 

GAAGAGGTTTCGATTTCTTGTACTTAGAGTTTTATCGTGCAGGAGAGTGGTGATAAAGAG 



Mboll 
CjePI | 

| j Earl Maell BsmAI 

II II I 

TGGAATGTGCATAGCAAAATACTCTCTTCAAAGAGTGATACGTCTATCTGTGTCTCTTGC 



Hpyl7 8III CviRI 



ACCTTACACGTATCGTTTTATGAGAGAAGTTTCTCACTATGCAGATAGACACAGAGAACG 



Cjel 
Bsal 
BsmAI 
ScrFI 
EcoRII | 
Siml | | | Ddel 
I I I 

TTATATGTTTACCTCGGAACATACCCATACAATGTATCAGGGTCTCCTGGAAGGGAACTC 

2341 + + + + + + 2400 

AATATACAAATGGAGCCTTGTATGGGTATGTTACATAGTCCCAGAGGACCTTCCCTTGAG 



Cjel 
Hpyl8 8IX | 
BsaJI | j Mnll 

i I 



Dpnl 
BstYI I BseMII 
Sau3AI | Alwl | 

I I 



Banll 
Bspl286I 
BpulOI CviJI | 

Ddel BslI | j 

I I II 



TCAGGGATCTTTCCACAACCATACCTTAGCAGGGGCTCTCTCCTGTGTTTTCTTACCTCA 

2401 + + + + + + 2460 

AGTCCCTAGAAAGGTGTTGGTATGGAATCGTCCCCGAGAGAGGACACAAAAGAATGGAGT 



Mnll 
Hinf I | 
Mnll | | 

I I 



Dpnl 
Beef I 
Bglll | 
BstYI | 
PstI j 
Sau3AI j 
CviRI | j 
Plel | j 
Sfcl | | | 
I 



Fokl 
I 



Bbvl 
BsaJI 
Hpyl88IX 
Bed 
CviJI 
Mnll 
BpulOI | 
Ddel | 
I I 



ACCTCACGGCGAGTCCCTGCAGATCTATCCCTTTATTACTGCCTTAGCCATCCGAGGAAA 

2461 + + + + + + 2520 

TGGAGTGCCGCTCAGGGACGTCTAGATAGGGAAATAATGACGGAATCGGTAGGCTCCTTT 



SUBSTITUTE SHEET (RULE 26) 
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. 24 (con't) 



Sthl32I 
Hpyl78III | Apol 
Bsal | | Tsp509I 
BsmAI | j Hpyl78III | 
Hinfl| j | BslI | j 
Fnu4HI Tf il j j j Aval | | j 

Tsel |Hpyl78III || j |NlaIIl|| j | Bpml Acil 
II Mill Ml I I I I 

TCTTGCTGCGTTTCAAGAATCTGGAGACCATGCTCGGGAATTTTCCCTACACCGCCCCCT 

2521 + + + + + + 2580 



AGAACGACGCAAAGTTCTTAGACCTCTGGTACGAGCCCTTAAAAGGGATGTGGCGGGGGA 

BsmAI 
BsmBI 
Aatll | 



BsaHI | j Hinfl Hhal 

Maell j | Tfil Thai | 

BslI | j j Sfcl Mnll|AciI | j Drdll MboII 

I I I I I II I I I I I 

AACGGACGTCTCCCTCCCTGTAGGAATCCGCGCTTCTTGGAAGAACCACCACCGAGTTCC 

2581 + + + + + + 2640 



TTGCCTGCAGAGGGAGGGACATCCTTAGGCGCGAAGAACCTTCTTGGTGGTGGCTCAAGG 



Hpyl78III 
Dpnl | 
BstYI | j 

CviJI Sau3AI | j 
BslI | Apol Alwl | j j 
Bfal | j Tsp509I Sfcl | III 
III I I I I I I 
CCTAGTCTGGCTCACAGAAATTTCCTATCGCTCTACTCTCTATAGGCAAGATCCTGAACT 
2641 + + + + + + 2700 



GGATCAGACCGAGTGTCTTTAAAGGATAGCGAGATGAGAGATATCCGTTCTAGGACTTGA 

Maelll 
Tsp45I 
MslI | 
Cjel | j 
Hgal | j j 
CviJI || || 
BsaJI BsaAI Hael | | j j 

Tsp509I Styl Maell | Hael I I || | j 

TaqI | Cjel CviJI | Rsal | j Cac8I | jj j | 

III II Ml I I II I I 

CCACTCGAAATTACTGATTAGCCAAGGTACGTGGACGACGCAGGCCACTCCTGTGACCTA 

2701 + + + + + + 2760 

GGTGAGCTTTAATGACTAATCGGTTCCATGCACCTGCTGCGTCCGGTGAGGACACTGGAT 
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Fig. 24 (con't) 

Aarl 
MslI | 

Dpnl BspMI CviRI | | Maelll 

Sau3AI | Alwl | Nlalll | j Tsp45I 

I I I I II I I 

CAATGCTTTAGGGATCAAAGTGAAAAATACCATGCAGGTGTTTCCTAAAGTCACTCTCTC 

2761 + + + + + + 2820 

GTTACGAAATCCCTAGTTTCACTTTTTATGGTACGTCCACAAAGGATTTCAGTGAGAGAG 



MboII 
Acil | 
i I 



BseMII 
Bsp24I | 
CjePI j 
Cjel| | 

I ! 



Maelll 
Tsp45I 
Hinf I | 
Mnll j 
Ddel | j 
' I I 



Cjel 
CjePI 
Bsp24I 
Mae I I | 
Msel | j 
lei 1 || 
I I II 



CTTAGATTACTCTGCGGATATTTCTTCCTCCACGCTGAGTCACTACTTAAACGTGGCGAG 

2821 + + + + + + 2880 

GAATCTAATGAGACGCCTATAAAGAAGGAGGTGCGACTCAGTGATGAATTTGCACCGCTC 



Mnll 
Plel [ 

Maelll BseRl| | 

Msel Tsp45I Msel NlalV Bfal j j j 

I I I I I I II 

TAGAATGAGATTTTAACAATAAGTGACCAAAACAGAAAGATTAAGGAACCTCTAGTGTCA 

2881 + + + + + + 2940 

ATCTTACTCTAAAATTGTTATTCACTGGTTTTGTCTTTCTAATTCCTTGGAGATCACAGT 



Apol 
Tsp509I 
Sthl32I Hpyl78III | 
Hinfl Ddel Mnll | Aval | j 

I ! II III 

AAGACTCCTCCTAAGTTTTTATTCTATCTCGGGAATTTCACAGCCTGCATGTTCGGGATG 

2941 + + + + + + 3000 

TTCTGAGGAGGATTCAAAAATAAGATAGAGCCCTTAAAGTGTCGGACGTACAAGCCCTAC 



Hpyl78III 

Nlalll | 

CviRI | j 

Cac8I | j j 

Sthl32I | j j 

CviJI | |NspI j 

II I I ! 
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Figure 25: 



cactgtggat gtgatattcg cagaacctcc cgtcaaatat actctagata taggaagcaa 60 

attacgactt taaacctcat ttaacgacag ggttgaggc atg cct ctt tct ttc 114 

Met Pro Leu Ser Phe 



aaa tct tea tct ttt tgt eta ctt gec tgt tta tgt agt gca agt tgc 
Lys Ser Ser Ser Phe Cys Leu Leu Ala Cys Leu Cys Ser Ala Ser Cys 



gcg ttt get gag act aga etc gga ggg aac ttt gtt cct cca att acg 
Ala Phe Ala Glu Thr Arg Leu Gly Gly Asn Phe Val Pro Pro lie Thr 



aat cag ggt gaa gag ate tta etc act tea gat ttt gtt tgt tea aac 
Asn Gin Gly Glu Glu lie Leu Leu Thr Ser Asp Phe Val Cys Ser Asn 



ttc ttg ggg gcg agt ttt tea agt tec ttt ate aac agt tec age aat 
Phe Leu Gly Ala Ser Phe Ser Ser Ser Phe lie Asn Ser Ser Ser Asn 



etc tec tta tta ggg aag ggc ctt tec tea acg ttt ace tct tgt 
Leu Ser Leu Leu Gly Lys Gly Leu Ser Leu Thr Phe Thr Ser Cys 



get cct aca aat agt aac tat gcg eta ctt tct gec gca gag act ctg 
Ala Pro Thr Asn Ser Asn Tyr Ala Leu Leu Ser Ala Ala Glu Thr Leu 
90 95 100 

acc ttc aag aat ttt tct tct ata aac ttt aca ggg aac caa teg aca 
Thr Phe Lys Asn Phe Ser Ser lie Asn Phe Thr Gly Asn Gin Ser Thr 
105 110 115 

gga ctt ggc ggc etc acc tac gga aaa gat att gtt ttc caa tct ate 
Gly Leu Gly Gly Leu lie Tyr Gly Lys Asp lie Val Phe Gin Ser lie 
120 125 130 

aaa gat ttg ate ttc act acg aac cgt gtt gec tat tct cca gca tct 
Lys Asp Leu lie Phe Thr Thr Asn Arg Val Ala Tyr Ser Pro Ala Ser 
135 140 145 

gta act acg teg gca act ccc gca ate act aca gta act aca gga gee 
Val Thr Thr Ser Ala Thr Pro Ala lie Thr Thr Val Thr Thr Gly Ala 
150 155 160 165 

tct get etc caa cct aca gac tea etc act gtc gaa aac ata tec caa 

Ser Ala Leu Gin Pro Thr Asp Ser Leu Thr Val Glu Asn lie Ser Gin 

Ser Ala Leu Gin Pro Thr Asd Ser Leu Thr Val Glu Asn lie Ser Gin 

170 . . 175 180 

teg ate aag ttt ttt ggg aac ctt gec aac ttc ggc tct gca att age 

Ser lie Lys Phe Phe Gly Asn Leu Ala Asn Phe Gly Ser Ala lie Ser 

Ser lie Lys Phe Phe Gly Asn Leu Ala Asn Phe Gly Ser Ala lie Ser 

185 190 195 
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Fig. 25 (con't) 

agt tct ccc acg gca gtc gtt aaa ttc ate aat aac acc get acc atg 
Ser Ser Pro Thr Ala Val Val Lys Phe lie Asn Asn Thr Ala Thr Met 
Ser Ser Pro Thr Ala Val Val Lys Phe lie Asn Asn Thr Ala Thr Met 

200 205 210 

age ttc tec cat aac ttt act teg tea gga g gc ggc gtg att tat gga 
Ser Phe Ser His Asn Phe Thr Ser Ser Gly Gly Gly Val He Tyr Gly 
Ser Phe Ser His Asn Phe Thr Ser Ser Gly Gly Gly Val He Tyr Glv 
215 220 225 

gga age tct etc ctt ttt gaa aac aat tct gga tgc ate ate ttc acc 

Gly Ser Ser Leu Leu Phe Glu Asn Asn Ser Gly Cys He lie Phe Th- 

Gly Ser Ser Leu Leu Phe Glu Asn Asn Ser Gly Cys He He Phe Thr 
230 235 _ ' 240 245 

gee aac tec tgt gtg aac age tta aaa ggc gtc acc cct tea tea gca 

Ala Asn Ser Cys Val Asn Ser Leu Lys Gly Val Thr Pro Se- Ser Glv 

Ala Asn Ser Cys Val Asn Ser Leu Lys Gly Val Thr Pro Ser Ser Gly 
250 255 260 

acc tat get tta gga agt ggc gga gec ate tgc ate cct acg gga ac- 

Thr Tyr Ala Leu Gly Ser Gly Gly Ala He Cvs He Pro Th- Glv Th- 

Thr Tyr Ala Leu Gly Ser Gly Gly Ala He Cvs He Pro Thr Gly Thr 
265 270 275 

ttc gaa tta aaa aac aat cag ggg aag tgc acc ttc tct tat aat ggc c 

Phe Glu Leu Lys Asn Asn Gin Gly Lys Cys Thr Phe Se- Tv- Asn CP v 

Phe Glu Leu Lys Asn Asn Gin Gly Lys Cys Thr Phe Ser Ty- ^sn Glv 
280 235 290 

aca cca aat gat gcg ggt gcg ate tac gee gaa acc tgc aac ate gta 1 

Thr Pro Asn Asp Ala Gly Ala He Tyr Ala Glu Thr Cys Asn Pe Val 

Thr Pro Asn Asp Ala Gly Ala He Tyr Ala Glu Thr Cys Asn lie Val 

295 300 305 

ggg aac cag ggt gec ttg etc eta gat age aac act gca gcg aga aat 1 

Gly Asn Gin Gly Ala Leu Leu Leu Asp Ser Asn Thr Ala Ala Arg Asn 

Gly Asn Gin Gxy Ala Leu Leu Leu Asp Ser Asn Thr Ala Ala Arg Asn 

310 315 320 325 

ggc gga gee ate tgt get aaa gtg etc aat att caa gga cgc ggt cct 1" 

Gly Gly Ala lie Cys Ala Lys Val Leu Asn He Gin Gly Arg Gly Pro 

Gly Gly Ala He Cys Ala Lys Val Leu Asn He Gin Gly Arg Gly P~o 

330 335 y 3 4 o 

tl C 5" aga aac Cg ° gcg gag aa 9 get att ttc ata 11 

^ U ^ S SSr ^ g Asn Ala Glu L ^ s G1 y Glv Ala He Phe He 

He Glu Phe Ser Arg Asn Arg Ala Glu Lys Gly Gly Ala He Phe He 
345 350 355 

ggc ccc tct gtt gga gac cct gcg aag caa aca teg aca ctt acg att 12 

c\l P r ° ^ ASP ?r ° Aia LyS Gin Thr S *= Thr Leu Th? He 

Gly Pro Ser Val Gly As? Pro Ala Lys Gin Thr Ser Thr Leu Thr H- 
360 - • 365 37 0 

ttg get tec gaa ggt gat att gcg ttc caa gga aac atg etc aat aca 17 

Leu Ala Ser Glu Gly Asp He Aia Phe Gin Sly Asn Me? Leu tin 

Leu Ala Ser Glu Gly Asp He Aia Phe Gin Gly Asn Met Leu Asn Thr 
3/5 380 3 8 5 
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aaa cct gga ate cgc aat gec ate act gta gaa gca ggg gga gag att 1314 

Lys Pro Gly lie Arg Asn Ala lie Thr Val Glu Ala Gly Gly Glu lie 

Lys Pro Gly lie Ar-g Asn Ala lie Thr Val Glu Ala Gly Gly Glu lie 

390 395 400 405 

gtg tct eta tct gca caa gga ggc tea cgt ctt gta ttt tat gat ccc 1362 

Val Ser Leu Ser Ala Gin Gly Gly Ser Arg Leu Val Phe Tyr Asp Pro 

Val Ser Leu Ser Ala Gin Gly Gly Ser Arg Leu Val Phe Tyr Asp Pro 
410 415 420 

att aca cat age etc cca acc aca agt ccg tct aat aaa gac act aca 14 10 

lie Thr His Ser Leu Pro Thr Thr Ser Pro Ser Asn Lys Asp lie Th- 

Ile Thr His Ser Leu Pro Thr Thr Ser Pro Ser Asn Lys Aso lie Thr 

425 430 435 

ate aac get aat ggc get tea gga tct gta gtc ttt aca agt aag gga 1453 

lie Asn Ala Asn Gly Ala Ser Gly Ser Val Val Phe Thr Ser Lys Gly 

He Asn Ala Asn Gly Ala Ser Gly Ser Val Val Phe Thr Ser Lys Glv 

440 445 450 

etc tec tct aca gaa etc ctg ttg cct gec aac acg aca act aca ctt 1506 

Leu Ser Ser Thr Glu Leu Leu Leu Pro Ala Asn Thr Thr Thr He Leu 

Leu Ser Ser Thr Glu Leu Leu Leu Pro Ala Asn Thr Thr Thr lie Leu 
455 460 465 

eta gga aca gtc aag ate get agt gga gaa ctg aag att act gac aat 1**4 

Leu Gly Thr Val Lys He Ala Ser Gly Glu Leu Lys lie Thr Aso Asn 

Leu Gly Thr Val Lys He Ala Ser Gly Glu Leu Lys He Thr £sd Asn 

470 475 480 " 485 

gcg gtt gtc aat gtt get ggc ttc get act cag ggc tea ggt cag ctt 1602 

Ala Val Val Asn Val Ala Gly Phe Ala Thr Gin Gly Ser Glv Gin Leu 

Ala Val Val Asn Val Ala Gly Phe Ala Thr Gin Gly Ser Gly Gin Leu 
490 495 500 

acc ctg ggc tct gga gga acc tta ggg ctg gca aca ccc acg gca gca 1650 

Thr Leu Gly Ser Gly Giy Thr Leu Gly Leu Aia Thr Pro Thr Gly Ala 

Thr Leu Gly Ser Gly Gly Thr Leu Gly Leu Ala Thr Pro Thr Glv Ala 
505 510 515 

cct gec get gta gac ttt acg att gga aag tta gca ttc gat cct ttt 1698 

Pro Ala Ala Val Asp Phe Thr lie Gly Lys Leu Ala Phe Asp 0-0 Phe 

Pro Ala Ala Val Asp Phe Thr He Gly Lys Leu Ala Phe Asd Pro Phe 
520 525 530 

tee ttc eta aaa aga gat ttt gtt tea gca tea gta aat gca ggc aca 174 5 

Ser Phe Leu Lys Arg Asp Phe Val Ser Ala Ser Val Asn Ala Gly Thr 

Ser Phe Leu Lys Arg Asp Phe Val Ser Ala Ser Val Asn Ala Giy Thr 

535 540 545 

aaa aac gtc act tta aca gga get ctg gtt ctt gat gaa cat gac gtt 17 9a 

Lys Asn Val Thr Leu Thr- Gly Ala Leu Val Leu Aso Glu His Aso Va' 

Lys Asn Val Thr Leu Thr Gly Aia Leu Val Leu Asp Glu His Aso Val 

550 555 560 " 565 

aca gar ctt tat gat atg gtg tea tta caa tct cca gta gca att cct 18 4 2 

Thr Asp Leu Tyr Asp Met Val Ser Leu Gin Ser Pro Val Ala He n- Q 

Thr Asp Leu Tyr Asp Met Val Ser Leu Gin Ser Pro Val Ala lie P-o 

570 575 530 
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Fig. 25 (con't) 

act caa aag age gag gec tta att tec tgg aaa gca get tat ggt tat 

Thr Gin Lys Ser Glu Ala Leu He Ser Trp Lys Ala Ala Tyr Gly Tyr 

Thr Gin Lys Ser Glu Ala Leu He Ser Trp Lys Ala Ala Tyr Gly Tvr 

775 780 785 

tec aaa aat cac eta aat ace ace tac etc aga cct gac aaa get cca 

Ser Lys Asn His Leu Asn Thr Thr Tyr Leu Arg Pro Asd Lys Ala Pro 

Ser Lys Asn His Leu Asn Thr Thr Tyr Leu Arg Pro Asp Lvs Ala P-o 

790 _ 795 800 " 805 

aaa tct caa ggg caa tgg cat aac aat agt tac tat gtt ctt att tct 

Lys Ser Gin Gly Gin Trp His Asn Asn Ser Tyr Tyr Val Leu He Ser 

Lys Ser Gin Gly Gin Trp His Asn Asn Ser Tyr Tyr Val Leu He Ser 

810 815 820 

gca gaa cat cct ttc eta aac tgg tgt ctt ctt aca aga cct ctg get 

Ala Glu His Pro Phe Leu Asn Trp Cys Leu Leu Thr Arg Pro Leu Ala 

Ala Glu His Pro Phe Leu Asn Trp Cys Leu Leu Thr Arg Pro Leu Ala 
825 830 835 

caa get tgg gat ctt tea ggt ttt att tec gca gaa ttc eta ggt ggt 

Gin Ala Trp Asp Leu Ser Gly Phe He Ser Ala Glu Phe Leu Gly Gly 

Gin Ala Trp Asp Leu Ser Gly Phe He Ser Ala Glu Phe Leu GW Gly 

840 845 850 

tgg caa agt aag ttc aca gaa act gga gat ctg caa cgt age ttt agt 

Trp Gin Ser Lys Phe Thr Glu- Thr Giy Asn Leu Gin Arg Ser Phe Ser 

Trp Gin Ser Lys Phe Thr Glu Thr Gly Asd Leu Gin Arg Ser Phe Se- 
855 860 " 865 

aga ggt aaa ggg tac aat gtt tec eta ccg ata gga tgt tet tct caa 

Arg Gly Lys Gly Tyr Asn Val Ser Leu Pro He Gly Cys Ser Ser Gin 

Arg Gly Lys Gly Tyr Asn Val Ser Leu Pro He Gly Cys Ser Ser Gin 
870 875 880 885 

tgg ttc aca cca ttt aag aag get cct tct aca ctg acc ate aaa ctt 

Trp Phe Thr Pro Phe Lys Lys Ala Pro Ser Thr Leu Thr lie Lys Leu 

Trp Phe Thr Pro Phe Lys Lys Ala Pro Ser Thr Leu Thr He Lys Leu 

. 890 895 900 

gee tac aag cct gat ate tat cgt gtc aac cct cac aat att gtg act 

Ala Tyr Lys Pro Asp lie Tyr Arg Vai Asn Pro His Asn He Va 1 Thr 

Ala Tyr Lys Pro Asp lie Tyr Arg Val Asn Pro His Asn He Val Thr 
905 910 915 

gtc gtc tea aac caa gag age act teg ate tea gga gca aat eta cgc 

Val Vai Ser Asn Gin Glu Ser Thr Ser He Ser Gly Ala Asn Leu Arg 

Val Val Ser Asn Gin Glu Ser Thr Ser He Ser Gly Ala Asn Leu Arc 

920 925 930 

cgc cac gg- ttg ttt gta caa ate cat gat gta gta gat etc ace gag 

Arg His Gly Leu Phe Vai Gin lie His Aso Vai Val Aso Leu Th- Glu 

Arg His Giy Leu Phe Vai Gin He His Asd Val Val Asn Leu Thr G"> u 

935 940 945 
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Fig. 25 (con't) 

gac act cag gcc ttt eta aac tat acc ttt gac ggg aaa aat gga ttt 2994 

Asp Thr Gin Ala Phe Leu Asn Tyr Thr Phe Asp Gly Lys Asn Gly Phe 

Asp Thr Gin Ala Phe Leu Asn Tyr Thr Phe Asp Gly Lys Asn Gly Phe 

950 955 960 965 



aca aac cac cga gtg tct aca gga eta aaa tec aca ttt taaaactcta 3043 

Thr Asn His Arg Val Ser Thr Gly Leu Lys Ser Thr Phe 

Thr Asn His Arg Val Ser Thr Gly Leu Lys Ser Thr Phe 

970 975 

agetctgett agagttttcr gr.agccccgg tegtcttaga atccT.cr.atc catcaccgaa 3103 

gaacttagca atgaaggeca agattctcac tctatgagaa ccccccc 3150 
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Figure 26 (RY-47) 

Restriction enzyme analysis of CPN1 00630 



Hpyl78III 
Mnll | 

TspRI Sthl32I Bfalj 

Taal | Fokl BscGI | Xbal | | Tsp509I 

II I I I III I 

CACTGTGGATGTGATATTCGCAGAACCTCCCGTCAAATATACTCTAGATATAGGAAGCAA 

! + + + + + + 60 

GTGACACCTACACTATAAGCGTCTTGGAGGGCAGTTTATATGAGATCTATATCCTTCGTT 

Nlalll 
Nspl 

Dral SphI Mnll 

Msel| Msel Mnll CacBI j MboII | 

II II II II 

ATTACGATTTTAAACCTTATTTAACGACAGGGTTGAGGCATGCCTCTTTCTTTCAAATCT 

61 + + + + + + 120 

TAATGCTAAAATTTGGAATAAATTGCTGTCCCAACTCCGTACGGAGAAAGAAAGTTTAGA 



BsmAI Hinfl 
Hhal | Mnll 
Thai j Bfal | 

BseMU | | Plel | j 
AccI CviRI Mwol | j |DdeI | | j 

I I II I I I I I I 

TCATCTTTTTGTCTACTTGCCTGTTTATGTAGTGCAAGTTGCGCGTTTGCTGAGACTAGA 

121 + + + + + + 180 

AGTAGAAAAACAGATGAACGGACAAATACATCACGTTCAACGCGCAAACGACTCTGATCT 



Hpyl8 8IX 
I 



Tsp509I 

I 



Hinfl 
Tf il 
Mnll | Earl 
I I 



Dpnl 
Bglll 
BstYI 
Sau3AI 
Eco57I | 
I 
I 



Hpyl8 8IX 
Tthlllll 
MboII 
HphI 



CTCGGAGGGAACTTTGTTCCTCCAATTACGAATCAGGGTGAAGAGATCTTACTCACTTCA 

181 + + + + + + 240 

GAGCCTCCCTTGAAACAAGGAGGTTAATGCTTAGTCCCACTTCTCTAGAATGAGTGAAGT 



GATTTTGTTTGTTCAAACTTCTTGGGGGCGAGTTTTTCAAGTTCCTTTATCAATAGTTCC 

241 + + + + + + 300 

CTAAAACAAACAAGTTTGAAGAACCCCCGCTCAAAAAGTTCAAGGAAATAGTTATCAAGG 
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CviJI 
Haelll 
Eco0109I | 
BslI Sau96I j 
I I I 



Acll 
Maell 
Msel | 
I I 



Alul 
CviJI 
Mnll | 

I I 



AGCAATCTCTCCTTATTAGGGAAGGGCCTTTCCTTAACGTTTACCTCTTGTCAAGCTCCT 
TCGTTAGAGAGGAATAATCCCTTCCCGGAAAGGAATTGCAAATGGAGAACAGTTCGAGGA 
Plel 

Acil | Apol 
Fnu4HI j Tsp5 09I 

Taul j Hpyl88IX MboII j 

Maelll Hhal BsmAI | | Hinfl |Hpyl78IIl|| 

I I III I I Ml 

ACAAATAGTAACT ATGCG CTACTTTCTG CCG C AGAGACTCTG AC CTT C AAGAATTTTTCT 
TGTTTATCATTGATACGCGATGAAAGACGGCGTCTCTGAGACTGGAAGTTCTTAAAAAGA 



Drdll 
NlalV 
Bsp24I | 
Cjel| 
CjePI | 
I I 



CviJI 
Haelll 
Fnu4HI | 
Taul | 
Acil | j 
II I 



Cjel 
Mnll 
CjePI | 
Bsp24I | j 

I I I 



TCTATAAACTTTACAGGGAACCAATCGACAGGACTTGGCGGCCTCATCTACGGAAAAGAT 
AGATATTTGAAATGTCCCTTGGTTAGCTGTCCTGAACCGCCGGAGTAGATGCCTTTTCTA 



Dpnl 
Sau3AI | 



| | Pflll08I 
I I 



Cje 
Bsbl 
Taal | 
Bpml | 



1 I 
I I 



ATTGTTTTCCAATCTATCAAAGATTTGATCTTCACTACGAACCGTGTTGCCTATTCTCCA 
TAACAAAAGGTTAGATAGTTTCTAAACTAGAAGTGATGCTTGGCACAACGGATAAGAGGT 



Mae 1 1 
SfaNI | 
Maelll | j 
AlwNI | | | 

II II 



Maelll 
Sf cl 
Faul 



Sthl32I 
Acil | 
Mwol | j 
Cjel || | 
I II I 



Taal Sfcl 
I I 



CviJI 
NlaIV| 
II 



GCATCTGTAACTACGTCGGCAACTCCCGCAATCACTACAGTAACTACAGGAGCCTCTGCT 



CGTAGACATTGATGCAGCCGTTGAGGGCGTTAGTGATGTCATTGATGTCCTCGGAGACGA 
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Mmel Dpnl 
Sfcl TspRI Sau3AI | 

Plel| Taql| Clal | | 

Mnll | Hinfl Taal || TaqI | | 

I I ! I IN Ml 

CTCCAACCTACAGACTCACTCACTGTCGAAAACATATCCCAATCGATCAAGTTTTTTGGG 

601 + + + + + + 660 

GAGGTTGGATGTCTGAGTGAGTGACAGCTTTTGTATAGGGTTAGCTAGTTCAAAAAACCC 

Beef I 

Tsp509I Apol | 

CviRI | BsaJI Tsp509I j 

NlalV CviJI | | BstDSI Msel | j 

I III I III 

AACCTTGCCAACTTCGGCTCTGCAATTAGCAGTTCTCCCACGGCAGTCGTTAAATTCATC 

661 + + + + + + 720 

TTGGAACGGTTGAAGCCGAGACGTTAATCGTCAAGAGGGTGCCGTCAGCAATTTAAGTAG 

BsaXI 

Alul Fnu4HI 
CviJI, Hpyl78III Taul 

Acil Nlalll ] Mnll [ Acil| Mnll 

I II II II I 

AATAACACCGCTACCATGAGCTTCTCCCATAACTTTACTTCGTCAGGAGGCGGCGTGATT 

721 + + + + + + 780 

TTATTGTGGCGATGGTACTCGAAGAGGGTATTGAAATGAAGCAGTCCTCCGCCGCACTAA 

Nsil 
HaeIV| 
Hin4I j 
HphI | 
CviRI | | 
MboII | j j Acil 
Hpyl78III | III Fokl | 
Alul Tsp509I | j j | j SfaNI j 

CviJI SfaNI ( I i I i |CjeI | j 

I II I I III I I I 

TATGGAGGAAGCTCTCTCCTTTTTGAAAACAATTCTGGATGCATCATCTTCACCGCCAAC 

781 + + + + + + 840 

ATACCTCCTTCGAGAGAGGAAAAACTTTTGTTAAGACCTACGTAGTAGAAGTGGCGGTTG 
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Fig. 26 (con't) 



Maelll 
Tsp45I 
BsaHI 
Cjel 
Mwol 
HphI 



Msel 
Alul | 
CviJI j 
Hgal ! | 
I I I 



NlalV 
Hpyl78III | 
BslI | j 
EcoNI | j j 
III' 



TCCTGTGTGAACAGCTTAAAAGGCGTCACCCCTTCATCAGGAACCTATGCTTTAGGAAGT 
AGGACACACTTGTCGAATTTTCCGCAGTGGGGAAGTAGTCCTTGGATACGAAATCCTTCA 



CviRI 
Sthl32I 
Bed 
CviJI 
NlalV 



Ecil | 
Acil( | 
Fokl I I I 

III 



BscGI 
Sf aNI | 
II 



Msel 
Tsp509I | 
NspV | j 
Taqi | | 
I I 



BseSI 
CviRI 
ApaLI | 
I I 



901 



GGCGGAGCCATCTGCATCCCTACGGGAACTTTCGAATTAAAAAACAATCAGGGGAAGTGC 
CCGCCTCGGTAGACGTAGGGATGCCCTTGAAAGCTTAATTTTTTGTTAGTCCCCTTCACG 



BsiHKAI 
Bspl286I 
I 



Faul 
Sthl32I | 
Rsal | j 
SfaNI | j 

I I 



Mwol 
Dpnl | 
Acil Sau3AI | | 
I I 



CviRI 
I 



ACCTTCTCTTATAATGGTACACCAAATGATGCGGGTGCGATCTACGCCGAAACCTGCAAC 
TGGAAGAGAATATTACCATGTGGTTTACTACGCCCACGCTAGATGCGGCTTTGGACGTTG 



NlalV 
BanI 
ScrFI 
BsaJI | 
Drdll | j 
Pf 111081 EcoRIlj j 
BspMI | NlalV | j | 
I II ' 



PstI 
TspRI 
Fnu4HI ' 
CviRI 
Tsel 
BtsI | 
Sfcl j 
Bsbl | | 
I I 



NlalV 
Ecil | 
Acil | | 
Bbvl | | | 
III I 



1021 



ATCGTAGGGAACCAGGGTGCCTTGCTCCTAGATAGCAACACTGCAGCGAGAAATGGCGGA 
TAGCATCCCTTGGTCCCACGGAACGAGGATCTATCGTTGTGACGTCGCTCTTTACCGCCT 
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Fig. 26 (con't) 



Bed 
CviJI | Mwol 

I I 



Sspl 
BsiHKAI | 
Bspl286I 1 



• ■ Avail 
Sau96I 
Acil | 
Thai | 

Tthllll 



| Hgal 

I I 



Hpyl78III 
Apol 
EcoRI 
Tsp509I Bfal 
Xbal | 
I I 



G C C ATCTGTG CTAAAGTG CTCAATATTCAAGG ACG CGGT CCTATTG AATTCTCTAG AAAC 
CGGTAGACACGATTTCACGAGTTATAAGTTCCTGCGCCAGGATAACTTAAGAGATCTTTG 



NlalV 
Cvi JI | 

Mmel Haelllj BslI 

Alul | Eco0109l|| Bsal | Mnll 
CviJI I Sau96I j j BsmAI j SimI 
II Ml I I I 

CGCGCGGAGAAGGGTGGAGCTATTTTCATAGGCCCCTCTGTTGGAGACCCTGCGAAGCAA 

1141 + h + + + + 1200 

GCGCGCCTCTTCCCACCTCGATAAAAGTATCCGGGGAGACAACCTCTGGGACGCTTCGTT 



Acil 
Hhal 
Thai 
Thai | 



Tthlllll 

I 



HpylSBIX 
CviJI | 
I I 



BsaJI 
Styl 
HphI | 
I I 



Nlalll 
Nspl 
Cjel | 
I i 



ACATCGACACTTACGATTTTGGCTTCCGAAGGTGATATTGCGTTCCAAGGAAACATGCTC 

1201 + + ■+• + + + 1260 

TGTAGCTGTGAATGCTAAAACCGAAGGCTTCCACTATAACGCAAGGTTCCTTTGTACGAG 



Hin4I 
Hin4I j 
BsaXI | | 
Bsgl| | | 
II I I 

AATACAAAACCTGGAATCCGCAATGCCATCACTGTAGAAGCAGGGGGAGAGATTGTGTCT 

1261 + + + + + + 1320 

TTATGTTTTGGACCTTAGGCGTTACGGTAGTGACATCTTCGTCCCCCTCTCTAACACAGA 



Hinf I 
Tfil 
ScrFI | 
EcoRII | j Acil 
1 I I I 



Cjel TspRI 
Bed | Taal | 
BsrDI j Sf cl | j 
II III 



Dpnl 

CviRI Maell Sau3AI | CviJI 

BsmAI Mnll CviJI | Alwl | | BsaXI | 

i I II III II 

CTATCTGCACAAGGAGGCTCACGTCTTGTATTTTATGATCCCATTACACATAGCCTCCCA 

1321 + + + + + + 1380 

GATAGACGTGTTCCTCCGAGTGCAGAACATAAAATACTAGGGTAATGTGTATCGGAGGGT 
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Fig. 26 (con't) 



Eco5 7I 
I 



HaelV 
Hin4I 
Sf cl 
AlwNI 
Dpnl 
BstYI 
Sau3AI 
Hpyl7 8III 
Haell | 
Hhal | | 
Mwol | j | 
III ! 



ACCACAAGTCCGTCTAATAAAGACATTACAATCAACGCTAATGGCGCTTCAGGATCTGTA 
TGGTGTTCAGGCAGATTATTTCTGTAATGTTAGTTGCGATTACCGCGAAGTCCTAGACAT 
BseRI BsmFI 

PlellHinfl Sfcl |MnlI Cac8I Bsbl 

III III II 

GTCTTTACAAGTAAGGGACTCTCCTCTACAGAACTCCTGTTGCCTGCCAACACGACAACT 

CAGAAATGTTCATTCCCTGAGAGGAGATGTCTTGAGGACAACGGACGGTTGTGCTGTTGA 



Dpnl 
Sau3AI 
Hpyl78III 
Taal | 
cjePi | | 
Bfal || || |BfaI 

I I I III I 



Eco57I 
Acil | 
CjePI MboII | | 
I III 



ATACTTCTAGGAACAGTCAAGATCGCTAGTGGAGAACTGAAGATTACTGACAATGCGGTT 

+ + + + + + : 

TATGAAGATCCTTGTCAGTTCTAGCGATCACCTCTTGACTTCTAATGACTGTTACGCCAA 

Hpyl78III 
Banll 
Bspl286I 
CviJI 
Mnll 
ScrFl 
BsaJI 
BsaJI | 
BseMII | 
EcoRII j 
Alul | | 

CviJI j j 



Mwol 
CviJI | 
Cac8I j | 
I 



Banll 
Bspl286I 
BpulOI | 
Ddel j 

Ddel CviJI | | BseMII | 
I Ml II 



I I 



GTCAATGTTGCTGGCTTCGCTACTCAGGGCTCAGGTCAGCTTACCCTGGGCTCTGGAGGA 
CAGTTACAACGACCGAAGCGATGAGTCCCGAGTCCAGTCGAATGGGACCCGAGACCTCCT 
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AccI 
BspMI 
Sfcl 
MspAlI 
Acil 
Fnu4HI 
Taul 
Mwol 
Taqll 
Aarl 
BsiHKAI 
Bspl286I 
BscGI 
BsaJI 
BstDSI 
Bsbl 
Bpml 
Sthl32I 
Bsu36I Cac8I | 
Ddel CviJI | j 
I Ml 

ACCTTAGGGCTGGCAACACCCACGGGAGCACCTGCCGCTGTAGACTTTACGATTGGAAAG 

1621 + + + + + + 1680 

TGGAATCCCGACCGTTGTGGGTGCCCTCGTGGACGGCGACATCTGAAATGCTAACCTTTC 



Dpnl 
Sau3AI | 

Taql| j Cac8I 
Alwl | j j CviRI | 

Bsmi | | | SfaNI | | 

I II I III 
TTAGCATTCGATCCTTTTTCCTTCCTAAAAAGAGATTTTGTTTCAGCATCAGTAAATGCA 

1681 + + + + + + 1740 

AATCGTAAGCTAGGAAAAAGGAAGGATTTTTCTCTAAAACAAAGTCGTAGTCATTTACGT 

Hpyl78III 
Drdll 
Alol 
Ppil 
Banll | 
BsaXI j 
BsiHKAI j 
Bspl286I j 
Maelll SacI j 

Tsp45I Alul | | 

Maell | Msel CviJI j j 

II I IN 



Dpnl 

Maelll Bglll | 
MaellJ BstYI j 
Nlalll | j Sau3AI j 
III II 



GGCACAAAAAACGTCACTTTAACAGGAGCTCTGGTTCTTGATGAACATGACGTTACAGAT 



CCGTGTTTTTTGCAGTGAAATTGTCCTCGAGACCAAGAACTACTTGTACTGCAATGTCTA 
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Fig. 26 (con't) 



Bpml 

HaelV Tsp509I 
Hin4I BsrI | 

I I I 

CTTTATGATATGGTGTCATTACAATCTCCAGTAGCAATTCCTATCGCTGTTTTCAAAGGA 

GAAATACTATACCACAGTAATGTTAGAGGTCATCGTTAAGGATAGCGACAAAAGTTTCCT 

CviJI 
Bgll 
Mwol 
Cjel | 
CviJI | | 
I I I 



Maelll 
Taal 
BsaXI | Ddel 



Bed 

Hpyl78III I 



Hinf I 
Hin4I | 
Plel | I 
I I 



GCAACCGTTACTAAGACAGGATTTCCTGATGGGGAGATTGCGACTCCAAGCCACTACGGC 

+ h h + + + 

CGTTGGCAATGATTCTGTCCTAAAGGACTACCCCTCTAACGCTGAGGTTCGGTGATGCCG 



Tsp509I 
Sthl32I 
BscGI 



BsaJI 
Styl 



BsmFI 
Avail 
Sau96I 
Bcefl | 
BsmFI | 
I 



Cjel 
Nlaiv 
Avail 
Nlalll 
Sau96I 
BslI | 
I I 



Acelll 
BsmFI 
BslI 
Bed 
EcoNI 
Hpyl7 8III 
AlwNI | 
Mnll I 
Alul | I 
CviJI | j 
II 



TACCAAGGAAAGTGGTCCTACACATGGTCCCGTCCCCTGTTAATTCCAGCTCCTGATGGA 

1921 + + + + + + 1980 

ATGGTTCCTTTCACCAGGATGTGTACCAGGGCAGGGGACAATTAAGGTCGAGGACTACCT 



NlalV 
Avail 
EcoO109I 
Psp5II 
Sau96I 
ScrFI | 
EcoRII j j 
Mnll | | j 
III I 



Bpml 
Hhal | 
Mnll | | 
Bfal | | | 
I III 



Hpyl8 8IX 
Apol | 
EcoRI j 
Tsp509I j 
I I 



GGATTTCCTGGAGGTCCCTCTCCTAGCGCAAATACTCTCTATGCTGTATGGAATTCAGAC 

1981 + + + + + + 2040 

C CTAAAGGACCTC CAGGG AG AGG ATCG CGTTT ATG AGAG ATACGAC ATACCTT AAG TCTG 
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BssSI 
MslI 



Sthl32I 
Maelll 
Hpyl78III 
Aval 
Dpnl 



BstYI 
Sau3AI 
Ddel | 
Alwl | | 

I t I 



Cjel 
BsaXI | 
Hin4I | 
Tsp509I | | 
i I I 



ACTCTCGTGCGTTCTACCTATATCTTAGATCCCGAGCGTTACGGAGAAATTGTCAGCAAC 

2041 + + + + + + 2100 

TGAGAGCACGCAAGATGGATATAGAATCTAGGGCTCGCAATGCCTCTTTAACAGTCGTTG 



Alul Hpyl88IX 
CviJI Ddel Cjel Bsml | Fokl 

I I I I I I 

AGCTTATGGATTTCCTTCTTAGGAAATCAGGCATTCTCTGATATTCTCCAAGATGTTCTT 

2101 + + + + + + 2160 

TCGAATACCTAAAGGAAGAATCCTTTAGTCCGTAAGAGACTATAAGAGGTTCTACAAGAA 



Dpnl 
HaeIV| 
Hin4I I 
Sau3AI | j Aval 
Sthl32I j |BsaJI 
ill II 



Sthl32I 
Neil 
ScrFI 
Sraal 
Mspl | 
Neil j 
ScrFI 



Alul BsaXI 

CviJI Hin4I 

Hindlll | CviJI | 

Thai j j NlaIV| j 

Acil | | j Mwol | j j TaqI 

I I I I I II I I 

TTGATAGATCATCCCGGGTTGTCCATAACCGCGAAAGCTTTAGGAGCCTATGTCGAACAC 

2161 + + + + + + 2220 

AACTATCTAGTAGGGCCCAACAGGTATTGGCGCTTTCGAAATCCTCGGATACAGCTTGTG 



Mnll CviJI 
BslI | Nlalll | 
II II 



CviJI 
Mwol | 
Mnll Bbvl | j 
I III 



Fnu4HI 

Alul | 
CviJI j 

Tsel |HhaI 
I I 1 



ACACCAAGACAAGGACATGAGGGCTTTTCAGGTCGCTATGGAGGCTACCAAGCTGCGCTA 



TGTGGTTCTGTTCCTGTACTCCCGAAAAGTCCAGCGATACCTCCGATGGTTCGACGCGAT 



Alul 
CviJI 
Fnu4HI | 
Tsel | j 
Cjel 1| j Bbvl 

I I I MM I 

TCTATGAACTACACGGACCACACTACGTTAGGACTTTCTTTCGGGCAGCTTTATGGAAAA 

2281 + + + + + + 2340 

AGATACTTGATGTGCCTGGTGTGATGCAATCCTGAAAGAAAGCCCGTCGAAATACCTTTT 



Avail 
Sau96I 
Cjel | 



Maell Sthl32I 
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Fig. 26 (con't) 

Hinf I 
Tfil HpylSSIX 
Pf 111081 |MaeII Ddel | BseMII 

Mill I 
ACTAACGCCAACCCCTACGATTCACGTTGCTCAGAACAAATGTATTTACTCTCGTTCTTT 

2341 + + + h + + 2400 

TGATTGCGGTTGGGGATGCTAAGTGCAACGAGTCTTGTTTACATAAATGAGAGCAAGAAA 



Hinf I 
Hpyl78III I 
Maelll) j 
Tsp45l| j 
Plel | | | 
I I I 



Mnll 
I 



ScrFI 
EcoRII 
Tsp509I 
Msel | 



CviJI 
Hael 
Haelll 
StuI 



Alul 
CviJI 
Fnu4HI | 
Tsel| 



GGTCAATTCCCTATCGTGACTCAAAAGAGCGAGGCCTTAATTTCCTGGAAAGCAGCTTAT 

2401 + + + + + + 2460 

CCAGTTAAGGGATAGCACTGAGTTTTCTCGCTCCGGAATTAAAGGACCTTTCGTCGAATA 



Alul 
CviJI 
BseMII | 

HphI Hpyl88IX Bce83I | j 

Bbvl | Ddel | Mnll | jj Smll 

II II 11 II I 

GGTTATTCCAAAAATCACCTAAATACCACCTACCTCAGACCTGACAAAGCTCCAAAATCT 

2461 + + -t- + + + 2520 

CCAATAAGGTTTTTAGTGGATTTATGGTGGATGGAGTCTGGACTGTTTCGAGGTTTTAGA 



BsrDI Maelll 



PstI 
CviRI | 
Fokl Sfcl | j Alol 
I I I I I I I 

CAAGGGCAATGGCATAACAATAGTTACTATGTTCTTATTTCTGCAGAACATCCTTTCCTA 

2521 + + + + + + 2580 

GTTCCCGTTACCGTATTGTTATCAATGATACAAGAATAAAGACGTCTTGTAGGAAAGGAT 



Dpnl 
BstYI 
Sau3AI 
Alul 
CviJI 
Hindlll [ 
Mnll | 

Bbsl Smll | j 

MboII BsrI Bce83I CviJI | j j | | Alwl Acil 

II I I I I I I I I I 

AACTGGTGTCTTCTTACAAGACCTCTGGCTCAAGCTTGGGATCTTTCAGGTTTTATTTCC 

2581 + + + + + + 2640 

TTGACCACAGAAGAATGTTCTGGAGACCGAGTTCGAACCCTAGAAAGTCCAAAATAAAGG 
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Bf al 
Avrll | 
BsaJI j 
Apol | I 

EcoRI | | 

Tsp509I Styl j 
I i 



Mae 1 1 
CviRI 
Dpnl 
Bglll 



Cjel 

I 



BsrI 
BstYI 
Sau3AI 
AlwNI | 
I 



Alul 
CviJI 
Cjel | 
I I 



GCAGAATTCCTAGGTGGTTGGCAAAGTAAGTTCACAGAAACTGGAGATCTGCAACGTAGC 

641 + + + + + + 

CGTCTTAAGGATCCACCAACCGTTTCATTCAAGTGTCTTTGACCTCTAGACGTTGCATCG 

Bpml Drdll 
Mnlll Rsal MboII Fokl | 

II I I II 

TTTAGTAGAGGTAAAGGGTACAATGTTTCCCTACCGATAGGATGTTCTTCTCAATGGTTC 

701 + + + + + + 

AAATCATCTCCATTTCCCATGTTACAAAGGGATGGCTATCCTACAAGAAGAGTTACCAAG 



NlalV Bed BsaBI 

Msel CviJI | TspRlj CviJI EcoRV | 

III II III 

ACACCATTTAAG AAGG CTCCTTCTAC ACTGACCATC AAACTTGCCTAC AAG C CTG ATATC 

2761 + + + + + + 2820 

TGTGGTAAATTCTTCCGAGGAAGATGTGACTGGTAGTTTGAACGGATGTTCGGACTATAG 



Hindi 
I 



Maelll 
Tsp45I 
Mnll | 
Sspl | | 
I II 



PshAI 
Taal | 
I I 



BsraAI 
BsraBI 
I 



Ddel 
Dpnl 
Sau3AI 
Taql| 

BsiHKAI | | 
Bspl286I j j 
I I 



TATCGTGTCAACCCTCACAATATTGTGACTGTCGTCTCAAACCAAGAGAGCACTTCGATC 



ATAGCACAGTTGGGAGTGTTATAACACTGACAGCAGAGTTTGGTTCTCTCGTGAAGCTAG 



BsaJI 
BstDSI 
Tthlllll 
Acil | 
Fnu4HI j 
BseMII | | 
Cjel |Taul| 
I 



Mnll 
Dpnl 
Bglll 
BstYI 
Sau3AI 



Rsal 

BsrGI | Cjel 
Taal TatI j 

I I I II I I II 

TCAGGAGCAAATCTACGCCGCCACGGTTTGTTTGTACAAATCCATGATGTAGTAGATCTC 

2881 + + + + + + 2940 

AGTCCTCGTTTAGATGCGGCGGTGCCAAACAAACATGTTTAGGTACTACATCATCTAGAG 



Nlalll HphI | 
I I I 
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CviJI 
Hael 
Haelll 
Ddel StuI 



Bsp24I 
CjePI 
Cjel| 

I I 



Sthl32I BscGI 
BsaJI Ddel StuI BseMII | BslI j 

I II II M 

ACCG AGG ACACT CAGG C C TTTCTAAACTATACCTTTGACGGG AAAAATGG ATTTAC AAAC 

2941 + + + + + + 3000 

TGGCTCCTGTGAGTCCGGAAAGATTTGATATGGAAACTGCCCTTTTTACCTAAATGTTTG 

Sfcl Cjel Alul 

AccI | CjePI Dral CviJI 

Drain |i Bsp24I | Msel| Ddel | Ddel 

III II II II I 

CACCGAGTGTCTACAGGACTAAAATCCACATTTTAAAACTCTAAGCTCTGCTTAGAGTTT 

3001 + + + + + + 3060 

GTGGCTCACAGATGTCCTGATTTTAGGTGTAAAATTTTGAGATTCGAGACGAATCTCAAA 

Hinf I 
Ddel 
Sthl32I 
BsiEI 



Mspl 
Neil 
ScrFI 
BsaJI | 
CviJI | j 
Sfcl | | | 

I II I 



|TfiI 
I 



Taql 
Mnll | 
Bed | j 

I I I 



CviJI 
Hael 
Haelll 
BsrDI | 
Mwol | j 
Ddel MboII | j | 
I II I 



TCTGTAGCCCCGGTCGTCTTAGAATCCTCTATCCATCATCGAAGAACTTAGCAATGAAGG 
AGACATCGGGGCCAGCAGAATCTTAGGAGATAGGTAGTAGCTTCTTGAATCGTTACTTCC 



Hin4I 
Hinf I | 
Tf il j 
I I 

CCAAGATTCTCACTCTATGAGAACCCCCCC 

3121 + + + 3150 

GGTTCTAAGAGTGAGATACTCTTGGGGGGG 
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Figure 27: CPN100397 

1 MKIPLRFLLI SLVPTLSMSN LLGAATTEEL SASNSFDGTT STTSFSSKTS 

51 SATDGTNYVF KDSWIENVP KTGETQSTSC FKNDAAAGDL NFLGGGFSFT 

101 FSNTDATTAS GAAIGSEAAN KTVTLSGFSA LSFLKS PAST VTNGLGAINV 

151 KGNLSLLDND KVLIQDNFST GDGGAINCAG SLKIANNKSL SFIGNSSSTR 

201 GGAIHTKNLT LSSGGETLFQ GNTAPTAAGK GGAIAIADSG TLSISGDSGD 

251 IIFEGNTIGA TGTVSHSAID LGTSAKITAL RAAQGHT I Y F YDPITVTGST 

3 01 SVADALNINS PDTGDNKEYT GTIVFSGEKL TEAEAKDEKN RTSKLLQNVA 
351 FKNGTWLKG DWLSANGFS QDANSKLIMD LGTSLVANTE SIELTNLEIN 

4 01 IDSLRNGKKI KliSAATAQKD IRIDRPWLA ISDESFYQNG FLNEDHSYDG 
451 ILELDAGKDI VISADSRSID AVQSPYGYQG KWTINWSTDD KKATVSWAKQ 
501 SFNPTAEQEA PL.VPNLLWGS FIDVRSFQNF IELGTEGAPY EKRFWVAGIS 
551 NVLHRSGREN QRKFRHVSGG AWGASTRMP GGDTLSLGFA QLFARDKDYF 
601 MMTNFAKTYA GSLRLQHDAS LYSWSILLG EGGLREILLP YVSKTLPCSF 
651 YGQLS YGHTD HRMKTESLPP PPPTLSTDHT SWGGYVWAGE LGTRVAVENT 
701 SGRGFFQEYT P F VKVOAVY A RQDSFVELGA ISRDFSDSHL YNLAIPLGIK 
751 LEKRFAEQYY HWAMYSPDV CRSNPKCTTT LLSNQGSWKT KGSNLARQAG 
801 IVQASGFRSL GAAAELFGNF GFEWRGSSRS YNVDAGSKIK F 



Possible T cell epitope: 
516 LLWGSFIDV 
Possible B cell epitope: 
554 HRSGRENQRKFRHV 
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Figure 28: 



CPN100421 



1 MPPLNADDVL PRDHLSDGSF SDTYPDITTQ AIILIFLALS P FL VMLLTS Y 

51 LKIIITLVLL RNALGVQQTP PSQVLNGIAL ILSIYVMFPT G VAM Y KD ARK 

101 EIEANTIPQS LFTAEGAETV FVALNKSKEP LRSFLIRNTP KAQIQSFYKI 

151 SQKTFPSEIR AHLTASDFVI IIPAFIMGQI KNAFEIGVLI YLPFFVIDLV 

201 TANVLVAMQM MMLSPLSISL PLKLLLIVMV DGWTLLLQGL MISFK 



Possible T cell epitope: 



18 8 VLIYLPFFV 



Possible B cell epitope: 
12 5 NKSKEPLR 
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Figure 29: CPN100422 

1 MKFFSLIFKD DDVSPNKKVL SPEAFSAFLD AKELLEKTKA DSEAYVAETE 

51 QKCAQIRQEA KDQGFKEGSE SWSKQIAFLE EETKNLRIRV REAJLVPLAIA 

101 SVRKIIGKEL ELHPETIVSI ISQALKELTQ NKHIIISVNP KDLPLVEKSR 

151 PELKNIVEYA DSLILTAKPD VTPGGCIIET EAGI1NAQLD VQLDAliEKAF 

2 01 STILKAKNPV DEPSETSSST DSSSLSNDQD KKE 



Possible T cell epitope: 
163 LILTAKPDV 

Possible B cell epitope: 
226 SNDQDKKE 
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1 MTLLCCTSCN 
51 AATEQMWDIA 
101 QEKIRYQEGL 
151 HRGVLDNPNS 
2 01 GLTEEIDYVS 
251 TLIMTMGGTK 
301 ETSDKDSSDK 



SRSLIVHGLP 
VPSAQITEAL 
SEQMASTIRK 
IMVSKIKRLI 
VWGIILAKSS 
GFFNPTPYTK 
DAPEGSNEIE 



GREANEIWL 
AILNQAGLPR 
MDGWDAS VQ 
ASAVPGLVPE 
LTKFRL I FYV 
NALEAKKAEG 
GA 



LVSKGVAAQK LPQAAAATAG 
MKGTSLLDLF AKQGLVPSEL 
ISFTTENEDN LPLTASVYIK 
NVSWSDRAA YSDITINGPW 
LILILFVISC GLLWVIWKTH 
AAADKEKKED ADSQGESKNA 



Possible T cell epitope: 
201 GLTEEIDYV 
Possible B cell epitope: 

2 84 DKEKKEDADSQGESKNAETSDKDSSDKDAPEGSNEIE 
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Figure 31: CPN100426 

1 MTIRVRNLAY SVNKKKILDG VTFSLERGHI TLFVGKSGSG KTMILRALAG 

51 LVQPTQGDIW IEGEAPALVF QQPELFSHMT VLGNCTHPQI HIKGRSTEEA 

101 REKAFELLHL LDIEEVAKNY PDQLSGGQKQ RVAIVRSLCM DKHTLLFDEP 

151 TSAliDPFATA SFRHLLETLR DQELTVGLTT HDMQFVHSCL DRIYLIDQGT 

201 VAGVYDKRDG ELDSGHPLSK YIHSAQ 



PCT/CA99/00992 



Possible T cell epitope: 
145 LLFDEPTSA 
Possible B cell epitope: 
2 05 YDKRDGE 
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Figure 32: CPN100508 

1 MKRPFFTYLC IIFYGSCASL 

51 DIDAVIYPAS MTKIATALFI 

101 GYRSPPHWLE TDGSTIQLHL 

151 GSVEKFMDKL NFFLKEEIGC 

2 01 LKEPPFRGVI STTSYKIGAT 
251 TGTTKTAGKN LIMAAEKNNR 

3 01 PLLRKELVPP SDCLQLEIAN 
351 AHADAFPIEQ GDLLGHWVFY 

4 01 RVFTSYRTYM SITMLLMYFR 



Possible T cell epitope: 
156 FMDKLNFFL 
Possible B cell epitope: 
4 22 RKHRKYKN 
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SLHAGLSFPE VRGATAAWH ADSGKVFYDK 

LKHYPTVIiDT LIKVKQDAIA SITPQAKKQS 

REELLGWDLF HAjLL VCS AND AANVLAMACC 

THTHFNNPHG LHHPNHYTTT RDLISIMRCA 

NLHGERILSP TNKLLLPGST YH Y P P AliGG K 

LLVTIATGYS GPVSDLYQDV . IAIjCETVFNE 

LGKLSCPIiPE GLYYDFYASE DREPLSVSFI 

DDEGKKISSQ PFYAPCRFER T I KP WKL YM K 

IRKHRKYKML KHYSKI 
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Figure 33: CPN10051S 

1 MASNPILQIE DLSITLAKQR QQYPIVQSLS FTTNEGQTLA IIGESGSGKS 

51 VSAHAILRLL PCPPFSVSGQ VNFQGHNLLT ASRSIQKKII GTEISMIFQN 

101 PQASLNPVFT IEQQFREIIH THLALTAEVA KEKMLYALEE TGFHDPRLCL 

151 NliYPHQLSGG MLQRICIAMA LLCSPKLLIA DEPTTALDVS VQYQILQLLK 

2 01 TLQKKTGMSL LIITHNMGW AETADDVLVL YAGRMVE CAP AVQMFHNPSH 
251 PYTRDLLASR PSLQPQQLGS FNPIPGQPPH YTAFPSGCRY HPRCSKILNR 

3 01 CSAEAPEIYP VREGHKVRVG CMTTNFPQPL IQATSLTKHY YKRS FWFQGK 
351 TIASRPVDDV SFSLYSRRAV GLIGESGSGK STLALALAGL LPLTSGFLTF 
401 NGTPIKLHSK HGRHQLRSQV RLVFQNPQAS LNPRKTILDS LGHSL.LYHKL 
451 VPKEKVLATV REYLELVGLS EEYFYRYPHQ LSGGQQQRVS IARALLGVPQ 
501 LIICDEIVSA LDLSIQAQIL NMLAELQKKL SLTYLFISHD LAWRSFCTE 
551 VFIMYKGQIV EKGNTKRIFS DPQHPYTRML LNAQLPETPD QRQSKPIFQE 
601 YHKDSEESCS TGCYFYNRCP QKQEACKSEI I PNQGDAHHT YRCIH 



Possible T cell epitope: 

5 9 LLPCPPFSV 

Possible B cell epitopes: 

18 KQRQQY 

5 87 ETPDQRQSK 



159/165 
SUBSTITUTE SHEET (RULE 26) 



Title: CHLAMYDIA ANTiGEN3AND 



WO 00/24765 



CORRESPONDING DNA FRAGMENTS 
AND USES THEREOF 
Inventor(s): Andrew D. MURDIN et al 
DOCKET NO.: 032931/0251 



PCT/CA99/00992 



Figure 34: CPN100538 

1 MPGIEKAATT VAVPQDKSEE EKVKERLTKR ELTCEDLKDN GYTVNFEDIS 

51 ILELLQFVSK ISGTNFVFDS NDLQFNVTIV SHDPTSVDDL STILLQVLKM 

101 HDLKWEQGN NVLIYRNPHL SKLSTWTDS SLKETCEAW VTRVFRLYRR 

151 QPSAAVNIIQ PLLSHDAIVS ASEATRHVII SDIAGNVDKV SDLLAALDCP 

201 GTSVDMTEYE VKYANPAALV SYCQDVLGTL AEDDAFQMFI QPGTNKIFW 

251 S S PRliANKAE QLL.KSLDVPE MAHTLDDPAS TALALGGTGT TSPKSLRFFM 

301 YKLKYQNGEV IANALQDIGY NL YVTT AMD E DFINTLNSIQ WLEVNNSIVI 

351 I GNQGNVDRV IGLLNGLDLP PKQVYIEVLI LDTSLEKSWD FGVQWVALGD 

401 EQSKVAYASG LLNNTGIATP TKATVPPGTP NPGSIPLPTP GQLTGFSDML 

451 NSSSAFGLGI IGNVLSHKGK SFLTLGGLLS ALDQDGDTVI VLNPRIMAQD 

501 TQQASFFVGQ TVPYQTIKYY IQETGTVTQN IDYEDIGVNL WTSTVAPNN 

551 WTliQIEQTI SELHSASGSL TPVTDKTYAA TRLQIPDGCF LVMSGHIRDK 

601 TTKWSGVPL LNSIPLIRGL FSRTIDQRQK RNIMMFIKPK VISSFEEGTR 

651 VTNKEGYRYN WEADEGSMQV APRHAP ECQG PPSLQAESDF KIIEIEAQ 



Possible T cell epitope: 

5 0 SILELLQFV 

Possible B cell epitopes: 

15 QDKSEEEK 
62 6 DQRQKRN 
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Figure 35: CPN100557 

1 MSRKDNEVS L ARSIFNILSG TFCSRITGIF REIAMATYFG ADPIVAAFWL 

51 GFRTVFFLRK ILGGLILEQA FIPHFEFLRA QSLDRAAFFF RRFSRLIKGS 

101 TIIFTLLIEA VLWVFFNNVE EGTYDMILLT MILLPCGIFL MMYNWGALL 

151 HCGNKFFGVG LAPWVNIIW IFFVIAARHS DPRERIIGLS VALVIGFFFE 

2 01 WLITVPGVWK FLLEAKSPPQ EHDSVRALLA PLSLGILTSS IFQLNLLSDI 
251 CliARYVHE I G PLYLMYSLKI YQLPIHLFGF GVFTVLLPAI SRCVQREDHE 

3 01 RGLKLMKFVL TLTMSVMIIM TAGLLLLALP GVRVLYEHGL FPQSAVYAIV 
351 RVLRGYGAS I IPMALAPLVS VLFYAQRQYA VPLFIGIGTA LANIVLSLVL 
401 GRWVLKDVSG ISYATSITAW VQLYFLWYYS SKRLPMYSKL LWESIRRSIK 
451 VMGTTMLACM ITLGLNILTQ TTYVIFLNPL TPLAWPLSSI TAQAIAFLSE 
501 SCIFLAFLFG FAKLLRVEDL INLASFEYWR GQRGLLQRQH VMQDTQN 



Possible T cell epitope: 

111 VLWVFFNNV 

Possible B cell epitopes: 

1 MSRKDNE 

2 95 QREDHERG 
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Figure 36: CPN100622 

1 MKTSRNKQCK ITDPLSKSSF FVGALILGKT TILLNATPLS DYFDNQANQL 

51 TTLFPLIDTL TNMTPYSHRA TLFGVRDDTN QD I VLDHQNS IESWFENFSQ 

101 DGGALSCKSL AITNTKNQIL FLNSFAIKRA GAMYVDGNFD LSENHGSIIF 

151 SGNLSFPNAS NFADTCTGGA VLCSKNVTIS KNQGTAYFIN NKAKSSGGAI 

2 01 QAAIINIKDN TGPCLFFNNA AGGTAGGALF ANACRIENNS QPIYFLNNQS 
251 GLGGAIRVHQ ECILTKNTGS VIFNNNFAME ADISANHSSG GAIYCISCSI 
'3 01 KDNPG I AAFD NNTAARDGGA ICTQSLTIQD SGPVYFTNNQ GTWGGAIMLR 

3 51 QDGACTLFAD QGDIIFYNNR HFKDTFSNHV SWCTRNVSL TVGASQGHSA 

4 01 TFYDPILQRY TIQNSIQKFN PNPEHLGTIL FSSTYIPDTS TSRDDFISHF 
451 RNHIGLYNGT LALEDRAEWK VYKFDQFGGT LRLGSRAVFS TTDEEQSSSS 
501 VGSVININNL AINLPSIL.GN RVAPKLWIRP TGSSAPYSED NNPIINLSGP 
551 LSLLDDENLD PYDTADLAQP IAEVPLLYLL DVTAKHINTD NFYPEGLNTT 
601 QHYGYQGVWS PYWIETITTS DTSSEDTVNT LHRQLYGDWT PTGYKVNPEN 
651 KGDIALSAFW QSFHNLFATL RYQTQQGQIA PTASGEATRL FVHQNSNNDA 
701 KGFHMEATGY SLGTTSNTAS NHSFGVNFSQ LFSNLYESHS DNSVASHTTT 
751 VALQINNPWL QERFSTSASL AYSYSNHHIK ASGYSGKIQT EGKCYSTTLG 
801 AAIiSCSLSLQ WRSRPLHFTP FIQAIAVRSN QTAFQESGDK ARKFSVHKPL 
851 YNLTVPLGIQ SAWESKFRLP TYWNIELAYQ PVLYQQNPEI NVSLESSGSS 
901 WLLSGTTLAR NAIAFKGRNQ IFIFPKLSVF LDYQGSVSSS TTTHYLHAGT 
951 TFKF 



Possible T cell epitope: 

119 ILFLNSFAI 

Possible B cell epitopes: 

2 KTSRNKQ 
64 7 NPENKG 
694 QNSNNDAK 
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Figure 37: CPN100626 

1 MQVFPKVTLS LDYSADISSS TLSHYLNVAS RMRFLTISDQ NRKIKEPLVS 
51 KTPPKFLFYL GNFTACMFGM TPAVYSLQTD SLEKFALERD EEFRTSFPLL 
101 DSLSTLTGFS PITTFVGNRH NSSQDIVLSN YKSIDNILLL WTSAGGAVSC 
151 NNFLLSNVED HAFFSKNLAI GTGGAIACQG ACT I TKNRG P LIFFSNRGLN 
201 NASTGGETRG GAIACNGDFT ISQNQGTFYF VNNSVNNWGG ALSTNGHCRI 
251 QSNRAPLLFF NNTAPSGGGA LRSENTTISD NTRPIYFKNN CGNNGGAIQT 
301 SVTVAIKNNS GSVIFNNNTA LSGSINSGNG SGGAIYTTNL, SIDDNPGTIL 
351 FNNNYCIRDG GAICTQFLTI KNSGHVYFTN NQGNWGGALM LLQDSTCLLF 
401 AEQGNIAFQN NEVFLTTFGR YNAIHCTPNS NLQLGANKGY TTAFFDPIEH 
451 QHPTTNPLIF NPNANHQGTI LFSSAYIPEA SDYENNFISS SKNTSELRNG 
501 VLSIEDRAGW QFYKFTQKGG I LKLGHAAS I ATTANSETPS TSVGSQVIIN 
551 NLAINLPSIL AKGKAPTLWI RPLQSSAPFT EDNNPTITL.S GPLTLLNEEN 
501 RDPYDSIDLS EPLQNIHLLS LSDVTARHIN TDNFHPESLN ATEHYGYQGI 
651 WSPYWVETIT TTNNASIETA NTLYRALYAN WTPLGYKVNP EYQGDLATTP 
701 LWQSFHTMFS LLRSYNRTGD SDIERPFLEI QGIADGLFVH QNSIPGAPGF 
751 RIQSTGYSLQ AS S ETSLHQK ISLGFAQFFT RTKEIGSSNN VSAHNTVSSL 
801 YVELPWFQEA FATSHSLAYG YGDHHLHAYI RHIKNRAEGT CYSHTLAAAI 
851 GCSFPWQQKS YLHLSPFVQA IAIRSHQTAF EEIGDNPRKF VSQKPFYNLT 
901 LPLGIQGKWQ SKFHVPTEWT LELSYQPVLY QQNPQIGVTL LASGGSWDIL 
951 GHNYVRNALG YKVHNQTALF RSLDLFLDYQ GSVSSSTSTH HLQAGSTLKF 



Possible T cell epitope: 

56 FLFYLGNFT 

Possible B cell epitopes: 

3 9 DQNRKIK 
5 97 NEENRDPYD 
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Figure 38: CPN100628 

1 MLLPFTFVLA NEGLQLPLET YITLSPEYQA APQVGFTHNQ NQDLAIVGNH 
51 NDFILDYKYY RSNG G ALTCK NLLISENIGN VFFEKNVCPN SGGAIYAAQN 
101 CTISKNQNYA FTTNLVSDNP TATAGSLLGG ALFAINCSIT NNLGQGTFVD 
151 NLALNKGGAL YTETNLSIKD NKGPIIIKQN RALNSDSLGG GIYSGNSLNI 
201 EGNSGAIQIT SNSSGSGGGI FSTQTLTISS NKKLIEISEN SAFANNYGSN 
251 FNPGGGGLTT TFCTILNNRE GVLFNNNQSQ SNGGAIHAKS IIIKENGPVY 
301 FLNNTATRGG ALLNLSAGSG NGSFILSADN GDIIFNNNTA SKHALNPPYR 
351 NAIHSTPNMN LQIGARPGYR VLFYDPIEHE LPSSFPILFN FETGHTGTVL 
401 FSGEHVHQNF TDEMNFFSYL, RNTSELRQGV LAVEDGAGLA CYKFFQRGGT 
451 LLLGQGAVIT TAGTIPTPSS TPTTVGSTIT LNHIAIDLPS ILSFQAQAPK 
501 IWIYPTKTGS TYTEDSNPTI TISGTLTLRN SNNEDPYDSL DLSHSLEKVP 
551 LLYIVDVAAQ KINSSQLDLS TLNSGEHYGY QGIWSTYWVE TTTITNPTSL 
601 LGANTKHKLL YANWSPLGYR PHPERRGEFI TNAL.WQSAYT ALAGLHSLSS 
651 WDEEKGHAAS LQGIGLLVHQ KDKNGFKGFR SHMTGYSATT EATSSQSPNF 
701 SLGFAQFFSK AKEHESQNST SSHHYFSGMC IAKYSLQRVI RLSVSLAYMF 
751 TSEHTHTMYQ GLLEGNSQGS FHNHTLAGAli SCVFLPQPHG ESLQIYPFIT 
801 ALAI RGNLAA FQESGDHARE FSLHRPLTDV SL.PVGIRASW KNHHRVPLVW 
851 LTEISYRSTL YRQDPELHSK LLISQGTWTT QATPVTYNAL G I KVKNTMQ V 
901 FPKVTLSLDY SADISSSTLS HYLNVASRMR F 



Possible T cell epitope: 

1 MLLPFTFVL 

Possible B cell epitopes: 

3 8 HNQNQ 

619 YRPHPERRG 

669 HQKDKNG 
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Figure 39: CPN100630 

1 MPLSFKSSSF CLLACLCSAS CAFAETRLGG NFVPPITNQG EEILLTSDFV^ 
51 CSNFLGASFS SSFINSSSNL SLLGKGLSLT FTSCQAPTNS NYALLSAAET \ 
101 LTFKNFSSIN FTGNQSTGLG GLIYGKDIVF QSIKDLIFTT NRVAYSPASV 
151 TTSATPAITT VTTGASALQP TDSLTVENIS QSIKFFGNLA NFGSAISSSP 
201 TAWKFINNT ATMSFSHKFT SSGGGVIYGG SSLLFENNSG CIIFTANSCV 
251 NSLKGVTPSS GTYALGSGGA ICIPTGTFEL KNNQGKCTFS YNGTPNDAGA 

3 01 IYAETCNIVG NQGALLLDSN TAARNGGAIC AKVLNIQGRG PIEFSRNRAE 
351 KGGAIFIGPS VGDPAKQTST LTILASEGDI AFQGNMLNTK PGIRNAITVE 

4 01 AGGEIVSLSA QGG S RLVFYD PITHSLPTTS PSNKDITINA NGASGSWFT 
451 SKGLSSTELL LPANTTTILL GTVKIASGEL KITDNAWNV AGFATQGSGQ 
501 LTLGSGGTLG LATPTGAPAA VDFTIGKLAF DPFSFLKRDF VSASVNAGTK 
551 NVTLTGALVL DEHDVTDLYD MVSLQSPVAI PIAVFKGATV TKTGFPDGEI 
601 ATPSHYGYQG KWSYTWSRPL LIPAPDGGFP GGPSPSANTL YAVWNSDTLV 
651 RSTYILDPER YGEIVSNSLW ISFLGNQAFS DILQDVLLID HPGLS ITAKA 
701 LGAYVEHTPR QGHEGFSGRY GGYQAALSMN YTDHTTLGLS FGQLYGKTNA 
751 NPYDSRCSEQ MYLLSFFGQF P I VTQKSEAL ISWKAAYGYS KNHLNTTYLR 
801 PDKAPKSQGQ WHNNSYYVLI SAEHPFLNWC LLTRPLAQAW DLSGFISAEF 
851 LGGWQSKFTE TGDLQRSFSR GKGYNVSLPI GCSSQWFTPF KKAPSTLTIK 
901 LAYKPDIYRV NPHNIVTWS NQESTSISGA NLRRHGLFVQ IHDWDLTED 
951 TQAFLNYTFD GKNGFTNHRV STGLKSTF 



Possible T cell epitope: 

93 6 GLFVQIHDV 

Possible B cell epitopes : 

281 KNNQGK 
34 5 SRNRAEK 
707 HTPRQGHE 



165/165 
SUBSTITUTE SHEET (RULE 26) 



RECE|> 

MAY 3 Oi 




IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 

COMBINED DECLARATION AND POWER OF ATTORNEY TECH CENTER 

As a below named inventor, I hereby declare that: my residence, post office address and citizenship are as stated 
below next to my name; that I verily believe that I am the original, first and sole inventor (if only one name is 
listed below) or a joint inventor (if plural inventors are named below) of the subject matter which is claimed and 
for which a patent is sought on the invention entitled: 



CHLAMYDIA ANTIGENS AND CORRESPONDING DNA FRAGMENTS AND USES THEREOF 



vAthe specification of which 



□ 



is attached hereto. 

tvas filed on April 27, 2001 



as U.S. Application Serial No. 09/830,446 
was filed on October 28, 1999 



as PCT International Application No. 



PCT/CA99/00992 



and (if applicable) was amended o 



February 8, 2001 



I hereby state that I have reviewed and understand the contents of the above identified specification, including 
the claims, as amended by any amendment referred to above. 

I acknowledge the duty to disclose information known to me which is material to the examination of this 
application in accordance with Title 37, Code of Federal Regulations, §§ 1.56(a) and (b), which state: 

"(a) A patent by its very nature is affected with a public interest. The public interest is best served, 
and the most effective patent examination occurs when, at the time an application is being 
examined, the Office is aware of and evaluates the teachings of all information material to 
patentability. Each individual associated with the filing and prosecution of a patent application 
has a duty of candor and good faith in dealing with the Office, which includes a duty to disclose 
to the Office all information known to that individual to be material to patentability as defined in 
this section. The duty to disclose information exists with respect to each pending claim until the 
claim is cancelled or withdrawn from consideration, or the application becomes abandoned. 
Information material to the patentability that is cancelled or withdrawn from consideration need 
not be submitted if the information is not material to the patentability of any claim remaining 
under consideration in the application. There is no duty to submit information which is not 
material to the patentability of any existing claim. The duty to disclose all information known to 
be material to patentability is deemed to be satisfied if all information known to be material to 
patentability of any claim issued in a patent was cited by the Office or submitted to the Office in 
the manner prescribed by §§ 1 .97(b)-(d) and 1 .98. However, no patent will be granted on an 
application in connection with which fraud on the Office was practiced or attempted or the duty 
of disclosure was violated through bad faith or intentional misconduct. The Office encourages 
applicants to carefully examine: 

(1) prior art cited in search reports of a foreign patent office in a counterpart application, 

(2) the closest information over which individuals associated with the filing or prosecution of a 
patent application believe any pending claim patentably defines, to make sure that any 
material information contained therein is disclosed to the Office. 
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(b) Under this section, information is material to patentability when it is not cumulative to 
information already of record or being made of record in the application, and 

(1) It establishes, by itself or in combination with other information, a prima facie case of 
unpatentability of a claim; or 

(2) It refutes, or is inconsistent with, a position the applicant takes in; 

(i) Opposing an argument of unpatentability relied on by the Office, or 

(ii) Asserting an argument of patentability. 

A prima facie case of unpatentability is established when the information compels a conclusion that a 
claim is unpatentable under the preponderance of evidence, burden-of-proof standard, giving each term 
in the claim its broadest reasonable construction consistent with the specification, and before any 
consideration is given to evidence which may be submitted in an attempt to establish a contrary 
conclusion of patentability." 

I hereby claim foreign priority benefits under 3 5 United States Code ,§119 and/or § 3 65 of any foreign 
application^) for patent or inventor's certificate listed below and have also identified below any foreign 
application for patent or inventor's certificate filed by me or my assignee disclosing the subject matter claimed 
in this application and having a filing date (1) before that of the application on which priority is claimed, or (2) 
if no priority claimed, before the filing of this application: 

PRIOR FOREIGN APPLICATIONS 

Date First 

Filing Date Laid-open or Date Patented Priority 

Number Country (Dav/Month/Year) Published or Granted Claimed? 



I hereby claim the benefit under 35 United States Code, § 1 1 9(e) of any United States provisional application(s) 
listed below: 



Application Number 


Filing Date 


60/106,034 


October 28, 1998 


60/106,044 


October 28, 1998 


60/106,039 


October 28, 1998 


60/106,042 


October 28, 1998 


60/106,087 


October 29, 1998 


60/106,072 


October 29, 1998 


60/106,073 


October 29, 1998 


60/106,074 


October 29, 1998 


60/106,589 


November 2, 1998 


60/107,034 


November 2, 1998 


60/107,035 


November 2, 1998 


60/106.587 


November 2, 1998 


60/106,588 


November 2, 1998 



I hereby claim the benefit under Title 35, United States Code, §120 of any United States application(s) listed 
below and, insofar as the subject matter of each of the claims of this application is not disclosed in the prior 
United States application in the manner provided by the first paragraph of Title 35, United States Code, § 1 12, 1 
acknowledge the duty to disclose information which is material to patentability as defined in Title 37, Code of 
Federal Regulations, §1.5 6(a) which became available between the filing date of the prior application and the 
national or PCT international filing date of this application: 



PRIOR U.S. OR PCT APPLICATIONS 



Filing Date Status 
(day/month/year) (pending, abandoned, granted) 



I hereby declare that all statements made herein of my own knowledge are true and that all statements made on 
information and belief are believed to be true; and further that these statements were made with the knowledge 
that wilful false statements and the like so made are punishable by fine or imprisonment, or both, under Section 
1001 of Title 18 of the United States Code and that such wilful false statements may jeopardize the validity of 
the application or any patent issued thereon. 

I hereby appoint the following patent agents with full power of substitution, association and revocation to 
prosecute this application and/or international application and to transact all business in the Patent and 
Trademark Office connected therewith: 



HUGH O'GORMAN (Reg. No. 26140} 
A. DAVID MORROW (Reg. No. 28816) 
JAMES McGRAW (Reg. No. 28168 1 
JOHN BOCHNOVIC (Reg. No. 29229) 
JOY D. MORROW (Reg. No. 30911 ) 
TOKUO HIRAMA (Reg. No . 32551 ) 
KOHJI SUZUKI (Reg. No. 44467) 
CHRISTINE N. GENGE (Reg. No. 454051 
DENNIS S.K. LEUNG (Reg. No. 47325) 
MATTHEW M. ROY (Reg. No. 48.074 ) 
ELLIOTT S. SIMCOE (Reg. No. 50,010) 
DAVID E. SCHWARTZ (Reg. No. 48,211) 
STEPHEN A. BENT (Reg. No. 29,768 ) 
BETH BURROUS (Reg. No. 35,087) 
WILLIAM T. ELLIS (Reg. No. 26 ,874) 
MICHAEL D. KAMINSKI (Reg. No. 32 ,904) 
KENNETH E. KROSIN (Reg. No. 25,735 ) 
JACK LAHR (Reg. No. 19,621) 
PETER G. MACK (Reg. No. 26,001 ) 
BRIAN J. MCNAMARA (Reg. No. 32,789) 
RICHARD C. PEET (Reg. No. 35.792 1 
ANDREW E. RAWLINS (Reg. No. 34.702 1 
CHARLES F. SCHJLL (Reg. No. 27,590) 
MICHELE M. SJMKIN (Reg. No. 34,717 ) 



ALAN R. CAMPBELL (Reg. No. 26129) 
ROBERT D. GOULD (Reg. No. 27523) 
THOMAS R. KELLY (Reg. No. 29244) 
MICHAEL E. WHEELER (Reg. No. 29246) 
R. ALLAN BRETT (Reg. No. 40476) 
PHILIP D. LAI 1 IN (Reg. No. 44443) 
R. JOHN HALEY (Reg. No. 22502) 
THUY HUONG NGUYEN (Reg. No_4233-6) 
KEVIN K. GRAHAM (Reg. No. P-47 . 365) 
DAVID E. SCHWARTZ (Reg. No. 48.211) 
STEVEN B. GARLAND (Reg. No. 5D,QQ3.) 
ELLIOTT S. SIMCOE (Reg. No. ££0^010) 
DAVID A. BLUMENTHAL (Reg. No. 26.257) 
ALAN I. CANTOR (Reg. No. 28,163) 
JOHN J. FELDHAUS (Reg. No. 28,822) 
LYLE K. KIMMS (Reg. No. 34,079) 
JOHNNY A. KUMAR (Reg. No. 34,649) 
GLENN LAW (Reg. No. J4,321) 
STEPHEN B. MAEBIUS (35.264) 
SYBIL MELOY (Reg. No. 22.749 1 
GEORGE E. QUTLLIN (Reg. No. 32.792 1 
BERNHARD D. SAXE (Reg. No. 28,665) 
RICHARD L. SCHWABB (Reg. No. 25.4791 
HAROLD C. WEGNER (Reg. No. 25. 258) 



PLEASE SEND CORRESPONDENCE TO: BERNHARD D. SAXE 
FOLEY & LARDNER 
3000 K Street N.W . 
Suite 500 

Washington. D.C. 20007-5109 
U.S.A. 

Telephone: (202) 672-5300 
Facsimile: (202) 672-5399 
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1) INVENTOR'S SIGNATURE: 



Inventor's Name: 



ANDREW, 




Date: 2L_ 



(First) 

Country of Citizenship: GREAT BRITAIN 
Residence: .Ttictrmnnri Hill . Ontario, C 



(Family Name) 



(City, Province, Country) 
Post Office Address: 1 1 forest Hill Drive, Richmond Hill. Ontario L4B 3C2, Canada 



2) INVENTOR'S SIGNATURE: 



Inventor's Name: 



RAYMOND 



Date: 2.2- t-l-fy ^<^ig-Z— 



(First) 

Country of Citizenship: CANADA 



Residence: Aurora, Ontario. Canada 



(City, Province, Coumry) 
Post Office Address: 29 Kennedy St. W., Aurora. Ontario LOG 1T0, Canada 



" 3) INVENTOR' S SIGNATURE: 
Inventor's Name: 



(First) ^ (Mi&MO 



WANG 



Country of Citizenship: CANADA 

Residence: Toronto. Ontario, Canada <= ~X- 

(City, Province, Country) 

Post Office Address: 5 1 Aspenwood Drive, Toronto, Ontario M2H 2ES, Canada 
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SEQUENCE LISTING 



<110> MURDIN, Andrew D . ; OOMEN, Raymond P; and WANG, Joe 

<12 0> Chlamydia antigens and corresponding DNA fragments and uses thereof 

10 <130> 77813-51 

<140> PCT/CA99/00992 
<141> 1999-10-28 

<150> US 60/106034 
<151> 1998-10-28 

<150> US 60/106039 

<151> 1998-10-28 

20 

<150> US 60/10604 

<151> 1998-10-28 

<150> US 60/106044 
<151> 1998-10-28 

<150> US 60/106072 
<151> 1998-10-29 

30 <150> US 60/106073 
<151> 1998-10-29 

<150> US 60/106074 
<151> 1998-10-29 

<150> US 60/106087 
<151> 1998-10-29 

<150> US 60/106587 
40 <151> 1998-11-02 

<150> US 60/106588 
<151> 1998-11-02 

<150> US 60/107089 
<151> 1998-11-02 

<150> US 60/107034 

<151> 1998-11-02 

50 

<150> US 60/107035 

<151> 1998-11-02 

<160> 45 

<170> Patentln Ver. 2.0 



<210> 1 
60 <211> 2750 
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<212> DNA 

<213> Chlamydia pneumoniae 

<220> 
<221> CDS 

<222> (101) . . (2623 ) 



10 <400> 1 

attttaacgt gcgtatcatt tgtgactaag agatagactt gctttcttta tctatcttct 60 



gtattggaaa gaaagcccct tgagggaaaa aaaggttgtt atg aag att cca etc 115 

Met Lys lie Pro Leu 
1 5 



cgc ttt tta ttg ata tea tta gta 
Arg Phe Leu Leu lie Ser Leu Val 
10 

20 

tta gga get get act ace gaa gag 
Leu Gly Ala Ala Thr Thr Glu Glu 
25 



cct acg ctt tct atg teg aat tta 163 
Pro Thr Leu Ser Met Ser Asn Leu 
15 20 

tta teg get age aat age ttc gat 211 
Leu Ser Ala Ser Asn Ser Phe Asp 
30 35 



gga act aca tea aca aca age ttt tct agt aaa aca tea teg get aca 259 
Gly Thr Thr Ser Thr Thr Ser Phe Ser Ser Lys Thr Ser Ser Ala Thr 
40 45 50 



gat ggc acc aat tat gtt ttt aaa gat tct gta gtt ata gaa aat gta 307 
3 0 Asp Gly Thr Asn Tyr Val Phe Lys Asp Ser Val Val lie Glu Asn Val 
55 60 65 



ccc aaa aca ggg gaa act cag tct act agt tgt ttt aaa aat gac get 355 
Pro Lys Thr Gly Glu Thr Gin Ser Thr Ser Cys Phe Lys Asn Asp Ala 
70 75 80 85 



gca get gga gat eta aat ttc tta gga ggg gga ttt tct ttc aca ttt 403 

Ala Ala Gly Asp Leu Asn Phe Leu Gly Gly Gly Phe Ser Phe Thr Phe 
90 95 100 

40 

age aat ate gat gca acc acg get tct gga get get att gga agt gaa 451 

Ser Asn lie Asp Ala Thr Thr Ala Ser Gly Ala Ala lie Gly Ser Glu 
105 110 115 



gca get aat aag aca gtc acg tta tea gga ttt teg gca ctt tct ttt 499 
Ala Ala Asn Lys Thr Val Thr Leu Ser Gly Phe Ser Ala Leu Ser Phe 
120 125 130 



ctt aaa tec cca gca agt aca gtg act aat gga ttg gga get ate aat 547 
5 0 Leu Lys Ser Pro Ala Ser Thr Val Thr Asn Gly Leu Gly Ala lie Asn 
135 140 145 



gtt aaa ggg aat tta age eta ttg gat aat gat aag gta ttg att cag 595 
Val Lys Gly Asn Leu Ser Leu Leu Asp Asn Asp Lys Val Leu lie Gin 
150 155 160 165 



60 



gac aat ttc tea aca gga gat ggc gga gca att aat tgt gca ggc tec 
Asp Asn Phe Ser Thr Gly Asp Gly Gly Ala lie Asn Cys Ala Gly Ser 
170 175 180 
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ttg aag ate gca aac aat aag tec ctt tct ttt att gga aat agt tct 
Leu Lys lie Ala Asn Asn Lys Ser Leu Ser Phe lie Gly Asn Ser Ser 
185 190 195 

tea aca cgt ggc gga gcg att cat acc aaa aac etc aca eta tct tct 
Ser Thr Arg Gly Gly Ala lie His Thr Lys Asn Leu Thr Leu Ser Ser 
200 205 210 

ggt ggg gaa act eta ttt cag ggg aat aca gcg cct acg get get ggt 
Gly Gly Glu Thr Leu Phe Gin Gly Asn Thr Ala Pro Thr Ala Ala Gly 
215 220 225 

aaa gga ggt get ate gcg att gca gac tct ggc acc eta tec att tct 
Lys Gly Gly Ala lie Ala lie Ala Asp Ser Gly Thr Leu Ser lie Ser 
230 235 240 245 

gga gac agt ggc gac att ate ttt gaa ggc aat acg ata gga get aca 
Gly Asp Ser Gly Asp lie lie Phe Glu Gly Asn Thr lie Gly Ala Thr 
250 255 260 

gga acc gtc tct cat agt get att gat tta gga act age get aag ata 
Gly Thr Val Ser His Ser Ala lie Asp Leu Gly Thr Ser Ala Lys lie 
265 270 275 

act gcg tta cgt get gcg caa gga cat acg ata tac ttt tat gat ccg 
Thr Ala Leu Arg Ala Ala Gin Gly His Thr lie Tyr Phe Tyr Asp Pro 
280 285 290 

att act gta aca gga teg aca tct gtt get gat get etc aat att aat 
lie Thr Val Thr Gly Ser Thr Ser Val Ala Asp Ala Leu Asn lie Asn 
295 300 305 

age cct gat act gga gat aac aaa gag tat acg gga acc ata gtc ttt 
Ser Pro Asp Thr Gly Asp Asn Lys Glu Tyr Thr Gly Thr lie Val Phe 
310 315 320 325 

tct gga gag aag etc acg gag gca gaa get aaa gat gag aag aac cgc 
Ser Gly Glu Lys Leu Thr Glu Ala Glu Ala Lys Asp Glu Lys Asn Arg 
330 335 340 

act tct aaa tta ctt caa aat gtt get ttt aaa aat ggg act gta gtt 
Thr Ser Lys Leu Leu Gin Asn Val Ala Phe Lys Asn Gly Thr Val Val 
345 350 355 

tta aaa ggt gat gtc gtt tta agt gcg aac ggt ttc tct cag gat gca 
Leu Lys Gly Asp Val Val Leu Ser Ala Asn Gly Phe Ser Gin Asp Ala 
360 365 370 

aac tct aag ttg att atg gat tta ggg acg teg ttg gtt gca aac acc 
Asn Ser Lys Leu lie Met Asp Leu Gly Thr Ser Leu Val Ala Asn Thr 
375 380 385 

gaa agt ate gag tta acg aat ttg gaa att aat ata gac tct etc agg 
Glu Ser lie Glu Leu Thr Asn Leu Glu lie Asn lie Asp Ser Leu Arg 
390 395 400 405 

aac ggg aaa aag ata aaa etc agt get gee aca get cag aaa gat att 
Asn Gly Lys Lys lie Lys Leu Ser Ala Ala Thr Ala Gin Lys Asp lie 
410 415 420 
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cgt ata gat cgt cct gtt gta ctg gca att age gat gag agt ttt tat 
Arg lie Asp Arg Pro Val Val Leu Ala lie Ser Asp Glu Ser Phe Tyr 
425 430 435 

caa aat ggc ttt ttg aat gag gac cat tec tat gat ggg att ctt gag 
Gin Asn Gly Phe Leu Asn Glu Asp His Ser Tyr Asp Gly lie Leu Glu 
440 445 450 

tta gat get ggg aaa gac ate gtg att tct gca gat tct cgc agt ata 
Leu Asp Ala Gly Lys Asp lie Val lie Ser Ala Asp Ser Arg Ser lie 
455 460 465 

gat get gta caa tct ccg tat ggc tat cag gga aag tgg acg ate aat 
Asp Ala Val Gin Ser Pro Tyr Gly Tyr Gin Gly Lys Trp Thr lie Asn 
470 475 480 485 

tgg tct act gat gat aag aaa get acg gtt tct tgg gcg aag cag agt 
Trp Ser Thr Asp Asp Lys Lys Ala Thr Val Ser Trp Ala Lys Gin Ser 
490 495 500 

ttt aat ccc act get gag cag gag get ccg tta gtt cct aat ctt ctt 
Phe Asn Pro Thr Ala Glu Gin Glu Ala Pro Leu Val Pro Asn Leu Leu 
505 510 515 

fc 99 99t tct ttt ata gat gtt cgt tec ttc cag aat ttt ata gag eta 
Trp Gly Ser Phe lie Asp Val Arg Ser Phe Gin Asn Phe lie Glu Leu 
520 525 530 

ggt act gaa ggt get cct tac gaa aag aga ttt tgg gtt gca ggc att 
Gly Thr Glu Gly Ala Pro Tyr Glu Lys Arg Phe Trp Val Ala Gly lie 
535 540 545 

tec aat gtt ttg cat agg age ggt cgt gaa aat caa agg aaa ttc cgt 
Ser Asn Val Leu His Arg Ser Gly Arg Glu Asn Gin Arg Lys Phe Arg 
550 555 560 565 

cat gtg agt gga ggt get gta gta ggt get age acg agg atg ccg ggt 
His Val Ser Gly Gly Ala Val Val Gly Ala Ser Thr Arg Met Pro Gly 
570 575 580 

ggt gat acc ttg tct ctg ggt ttt get cag etc ttt gcg cgt gac aaa 
Gly Asp Thr Leu Ser Leu Gly Phe Ala Gin Leu Phe Ala Arg Asp Lys 
585 590 595 

gac tac ttt atg aat acc aat ttc gca aag acc tac gca gga tct tta 
Asp Tyr Phe Met Asn Thr Asn Phe Ala Lys Thr Tyr Ala Gly Ser Leu 
600 605 610 

cgt ttg cag cac gat get tec eta tac tct gtg gtg agt ate ctt tta 
Arg Leu Gin His Asp Ala Ser Leu Tyr Ser Val Val Ser lie Leu Leu 
615 620 625 

gga gag gga gga etc cgc gag ate ctg ttg cct tat gtt tec aag act 
Gly Glu Gly Gly Leu Arg Glu lie Leu Leu Pro Tyr Val Ser Lys Thr 
630 635 640 645 

ctg ccg tgc tct ttc tat ggg cag ctt age tac ggc cat acg gat cat 
Leu Pro Cys Ser Phe Tyr Gly Gin Leu Ser Tyr Gly His Thr Asp His 
650 655 660 
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cgc atg aag acc gag tct eta ccc ccc ccc ccc ccg acg etc teg acg 2131 
Arg Met Lys Thr Glu Ser Leu Pro Pro Pro Pro Pro Thr Leu Ser Thr 
665 670 675 

gat cat act tct tgg gga gga tat gtc tgg get gga gag ctg gga act 217 9 
Asp His Thr Ser Trp Gly Gly Tyr Val Trp Ala Gly Glu Leu Gly Thr 
680 685 690 

cga gtt get gtt gaa aat acc age ggc aga gga ttt ttc caa gag tac 2227 
Arg Val Ala Val Glu Asn Thr Ser Gly Arg Gly Phe Phe Gin Glu Tyr 
695 700 705 

act cca ttt gta aaa gtc caa get gtt tac get cgc caa gat age ttt 2275 
Thr Pro Phe Val Lys Val Gin Ala Val Tyr Ala Arg Gin Asp Ser Phe 
710 715 720 725 

gta gaa eta gga get ate agt cgt gat ttt agt gat teg cat ctt tat 2323 
Val Glu Leu Gly Ala lie Ser Arg Asp Phe Ser Asp Ser His Leu Tyr 
730 735 740 

aac ctt gcg att cct ctt gga ate aag tta gag aaa egg ttt gca gag 2371 
Asn Leu Ala lie Pro Leu Gly lie Lys Leu Glu Lys Arg Phe Ala Glu 
745 750 755 

caa tat tat cat gtt gta gcg atg tat tct cca gat gtt tgt cgt agt 2419 
Gin Tyr Tyr His Val Val Ala Met Tyr Ser Pro Asp Val Cys Arg Ser 
760 765 770 

aac ccc aaa tgt acg act acc eta ctt tec aac caa ggg agt tgg aag 2467 
Asn Pro Lys Cys Thr Thr Thr Leu Leu Ser Asn Gin Gly Ser Trp Lys 
775 780 785 

acc aaa ggt teg aac tta gca aga cag get ggt att gtt cag gee tea 2515 
Thr Lys Gly Ser Asn Leu Ala Arg Gin Ala Gly lie Val Gin Ala Ser 
790 795 800 805 

ggt ttt cga tct ttg gga get gca gca gag ctt ttc ggg aac ttt ggc 2563 
Gly Phe Arg Ser Leu Gly Ala Ala Ala Glu Leu Phe Gly Asn Phe Gly 
810 815 820 

ttt gaa tgg egg gga tct tct cgt age tat aat gta gat gcg ggt age 2611 
Phe Glu Trp Arg Gly Ser Ser Arg Ser Tyr Asn Val Asp Ala Gly Ser 
825 830 835 

aaa ate aaa ttt tagegattte tctttcgatg ctatttttcc atggctattt 2663 
Lys lie Lys Phe 
840 

ttaaaatgat agccatggtt atagatacgt agtccttatt tcaaagaaga cactgttgca 2723 
ttagataege tctctgatcc ctcaaaa 2750 



<210> 2 
<211> 2523 
<212> DNA 

<213> Chlamydia pneumoniae 



60 <220> 

<221> CDS 
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atg aag att cca etc 
Met Lys lie Pro Leu 



cgc ttt tta ttg ata tea tta gta cct acg ctt tct atg teg aat tta 
Arg Phe Leu Leu lie Ser Leu Val Pro Thr Leu Ser Met Ser Asn Leu 



tta gga get get act ace gaa gag tta teg get age aat age ttc gat 
Leu Gly Ala Ala Thr Thr Glu Glu Leu Ser Ala Ser Asn Ser Phe Asp 



gga act aca tea aca aca age ttt tct agt aaa aca tea teg get aca 

Gly Thr Thr Ser Thr Thr Ser Phe Ser Ser Lys Thr Ser Ser Ala Thr 

40 45 50 

2 0 gat ggc ace aat tat gtt ttt aaa gat tct gta gtt ata gaa aat gta 

Asp Gly Thr Asn Tyr Val Phe Lys Asp Ser Val Val lie Glu Asn Val 



ccc aaa aca ggg gaa act cag tct act agt tgt ttt aaa aat gac get 
Pro Lys Thr Gly Glu Thr Gin Ser Thr Ser Cys Phe Lys Asn Asp Ala 



gca get gga gat eta aat ttc tta gga ggg gga ttt tct ttc aca ttt 

Ala Ala Gly Asp Leu Asn Phe Leu Gly Gly Gly Phe Ser Phe Thr Phe 

30 90 95 100 

age aat ate gat gca ace acg get tct gga get get att gga agt gaa 

Ser Asn lie Asp Ala Thr Thr Ala Ser Gly Ala Ala lie Gly Ser Glu 

105 110 115 

gca get aat aag aca gtc acg tta tea gga ttt teg gca ctt tct ttt 

Ala Ala Asn Lys Thr Val Thr Leu Ser Gly Phe Ser Ala Leu Ser Phe 

120 125 130 

4 0 ctt aaa tec cca gca agt aca gtg act aat gga ttg gga get ate aat 

Leu Lys Ser Pro Ala Ser Thr Val Thr Asn Gly Leu Gly Ala lie Asn 

135 140 145 

gtt aaa ggg aat tta age eta ttg gat aat gat aag gta ttg att cag 

Val Lys Gly Asn Leu Ser Leu Leu Asp Asn Asp Lys Val Leu lie Gin 

150 155 160 165 

gac aat ttc tea aca gga gat ggc gga gca att aat tgt gca ggc tec 

Asp Asn Phe Ser Thr Gly Asp Gly Gly Ala lie Asn Cys Ala Gly Ser 

50 170 175 180 

ttg aag ate gca aac aat aag tec ctt tct ttt att gga aat agt tct 

Leu Lys lie Ala Asn Asn Lys Ser Leu Ser Phe lie Gly Asn Ser Ser 

185 190 195 

tea aca cgt ggc gga gcg att cat ace aaa aac etc aca eta tct tct 

Ser Thr Arg Gly Gly Ala lie His Thr Lys Asn Leu Thr Leu Ser Ser 

200 205 210 



60 
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ggt ggg gaa act eta ttt cag ggg aat aca gcg cct acg get get ggt 
Gly Gly Glu Thr Leu Phe Gin Gly Asn Thr Ala Pro Thr Ala Ala Gly 
215 220 225 

aaa gga ggt get ate gcg att gca gac tct ggc acc eta tec att tct 
Lys Gly Gly Ala lie Ala lie Ala Asp Ser Gly Thr Leu Ser lie Ser 
230 235 240 245 

gga gac agt ggc gac att ate ttt gaa ggc aat acg ata gga get aca 
Gly Asp Ser Gly Asp lie lie Phe Glu Gly Asn Thr lie Gly Ala Thr 
250 255 260 

gga acc gtc tct cat agt get att gat tta gga act age get aag ata 
Gly Thr Val Ser His Ser Ala lie Asp Leu Gly Thr Ser Ala Lys lie 
265 270 275 

act gcg tta cgt get gcg caa gga cat acg ata tac ttt tat gat ccg 
Thr Ala Leu Arg Ala Ala Gin Gly His Thr lie Tyr Phe Tyr Asp Pro 
280 285 290 

att act gta aca gga teg aca tct gtt get gat get etc aat att aat 
lie Thr Val Thr Gly Ser Thr Ser Val Ala Asp Ala Leu Asn lie Asn 
295 300 305 

age cct gat act gga gat aac aaa gag tat acg gga acc ata gtc ttt 
Ser Pro Asp Thr Gly Asp Asn Lys Glu Tyr Thr Gly Thr lie Val Phe 
310 315 320 325 

tct gga gag aag etc acg gag gca gaa get aaa gat gag aag aac cgc 
Ser Gly Glu Lys Leu Thr Glu Ala Glu Ala Lys Asp Glu Lys Asn Arg 
330 335 340 

act tct aaa tta ctt caa aat gtt get ttt aaa aat ggg act gta gtt 
Thr Ser Lys Leu Leu Gin Asn Val Ala Phe Lys Asn Gly Thr Val Val 
345 350 355 

tta aaa ggt gat gtc gtt tta agt gcg aac ggt ttc tct cag gat gca 
Leu Lys Gly Asp Val Val Leu Ser Ala Asn Gly Phe Ser Gin Asp Ala 
360 365 370 

aac tct aag ttg att atg gat tta ggg acg teg ttg gtt gca aac acc 
Asn Ser Lys Leu lie Met Asp Leu Gly Thr Ser Leu Val Ala Asn Thr 
375 380 385 

gaa agt ate gag tta acg aat ttg gaa att aat ata gac tct etc agg 
Glu Ser He Glu Leu Thr Asn Leu Glu He Asn He Asp Ser Leu Arg 
390 395 400 405 

aac ggg aaa aag ata aaa etc agt get gec aca get cag aaa gat att 
Asn Gly Lys Lys He Lys Leu Ser Ala Ala Thr Ala Gin Lys Asp He 
410 415 420 

cgt ata gat cgt cct gtt gta ctg gca att age gat gag agt ttt tat 
Arg He Asp Arg Pro Val Val Leu Ala He Ser Asp Glu Ser Phe Tyr 
425 430 435 

caa aat ggc ttt ttg aat gag gac cat tec tat gat ggg att ctt gag 
Gin Asn Gly Phe Leu Asn Glu Asp His Ser Tyr Asp Gly He Leu Glu 
440 445 450 
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tta gat get ggg aaa gac ate gtg att tct gca gat tct cgc agt ata 1407 
Leu Asp Ala Gly Lys Asp lie Val lie Ser Ala Asp Ser Arg Ser lie 
455 460 465 

gat get gta caa tct ccg tat ggc tat cag gga aag tgg acg ate aat 1455 
Asp Ala Val Gin Ser Pro Tyr Gly Tyr Gin Gly Lys Trp Thr lie Asn 
470 475 480 485 

tgg tct act gat gat aag aaa get acg gtt tct tgg gcg aag cag agt 1503 
Trp Ser Thr Asp Asp Lys Lys Ala Thr Val Ser Trp Ala Lys Gin Ser 
490 495 500 

ttt aat ccc act get gag cag gag get ccg tta gtt cct aat ctt ctt 1551 
Phe Asn Pro Thr Ala Glu Gin Glu Ala Pro Leu Val Pro Asn Leu Leu 
505 510 515 

tgg ggt tct ttt ata gat gtt cgt tec ttc cag aat ttt ata gag eta 1599 
Trp Gly Ser Phe lie Asp Val Arg Ser Phe Gin Asn Phe lie Glu Leu 
520 525 530 

ggt act gaa ggt get cct tac gaa aag aga ttt tgg gtt gca ggc att 1647 
Gly Thr Glu Gly Ala Pro Tyr Glu Lys Arg Phe Trp Val Ala Gly lie 
535 540 545 

tec aat gtt ttg cat agg age ggt cgt gaa aat caa agg aaa ttc cgt 1695 
Ser Asn Val Leu His Arg Ser Gly Arg Glu Asn Gin Arg Lys Phe Arg 
550 555 560 565 

cat gtg agt gga ggt get gta gta ggt get age acg agg atg ccg ggt 1743 
His Val Ser Gly Gly Ala Val Val Gly Ala Ser Thr Arg Met Pro Gly 
570 575 580 

ggt gat ace ttg tct ctg ggt ttt get cag etc ttt gcg cgt gac aaa 1791 
Gly Asp Thr Leu Ser Leu Gly Phe Ala Gin Leu Phe Ala Arg Asp Lys 
585 590 595 

gac tac ttt atg aat acc aat ttc gca aag ace tac gca gga tct tta 1839 
Asp Tyr Phe Met Asn Thr Asn Phe Ala Lys Thr Tyr Ala Gly Ser Leu 
600 605 610 

cgt ttg cag cac gat get tec eta tac tct gtg gtg agt ate ctt tta 1887 
Arg Leu Gin His Asp Ala Ser Leu Tyr Ser Val Val Ser lie Leu Leu 
615 620 625 

gga gag gga gga etc cgc gag ate ctg ttg cct tat gtt tec aag act 1935 
Gly Glu Gly Gly Leu Arg Glu lie Leu Leu Pro Tyr Val Ser Lys Thr 
630 635 640 645 

ctg ccg tgc tct ttc tat ggg cag ctt age tac ggc cat acg gat cat 1983 
Leu Pro Cys Ser Phe Tyr Gly Gin Leu Ser Tyr Gly His Thr Asp His 
650 655 660 

cgc atg aag acc gag tct eta ccc ccc ccc ccc ccg acg etc teg acg 2031 
Arg Met Lys Thr Glu Ser Leu Pro Pro Pro Pro Pro Thr Leu Ser Thr 
665 670 675 

gat cat act tct tgg gga gga tat gtc tgg get gga gag ctg gga act 2 07 9 
Asp His Thr Ser Trp Gly Gly Tyr Val Trp Ala Gly Glu Leu Gly Thr 
680 685 690 
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30 



cga gtt get gtt gaa aat acc age ggc aga gga ttt ttc caa gag tac 

Arg Val Ala Val Glu Asn Thr Ser Gly Arg Gly Phe Phe Gin Glu Tyr 
695 700 705 

act cca ttt gta aaa gtc caa get gtt tac get cgc caa gat age ttt 

Thr Pro Phe Val Lys Val Gin Ala Val Tyr Ala Arg Gin Asp Ser Phe 

710 715 720 725 

gta gaa eta gga get ate agt cgt gat ttt agt gat teg cat ctt tat 

Val Glu Leu Gly Ala lie Ser Arg Asp Phe Ser Asp Ser His Leu Tyr 
730 735 740 

aac ctt gcg att cct ctt gga ate aag tta gag aaa egg ttt gca gag 

Asn Leu Ala He Pro Leu Gly He Lys Leu Glu Lys Arg Phe Ala Glu 
745 750 755 

caa tat tat cat gtt gta gcg atg tat tct cca gat gtt tgt cgt agt 

Gin Tyr Tyr His Val Val Ala Met Tyr Ser Pro Asp Val Cys Arg Ser 
760 765 770 

aac ccc aaa tgt acg act acc eta ctt tec aac caa ggg agt tgg aag 

Asn Pro Lys Cys Thr Thr Thr Leu Leu Ser Asn Gin Gly Ser Trp Lys 
775 780 785 

acc aaa ggt teg aac tta gca aga cag get ggt att gtt cag gec tea 

Thr Lys Gly Ser Asn Leu Ala Arg Gin Ala Gly lie Val Gin Ala Ser 

790 795 800 805 

ggt ttt cga tct ttg gga get gca gca gag ctt ttc ggg aac ttt ggc 

Gly Phe Arg Ser Leu Gly Ala Ala Ala Glu Leu Phe Gly Asn Phe Gly 
810 815 820 

ttt gaa tgg egg gga tct tct cgt age tat aat gta gat gcg ggt age 

Phe Glu Trp Arg Gly Ser Ser Arg Ser Tyr Asn Val Asp Ala Gly Ser 
825 830 835 

aaa ate aaa ttt 
Lys He Lys Phe 
840 



<210> 3 
<211> 1000 
<212> DNA 

<213> Chlamydia pneumoniae 

<220> 

<221> CDS 

<222> (101) . . (835) 

<400> 3 

ctcctgtccc tcgcgttgtc aacctacccc tcctccctcg aattctaatc etttgaaegt 60 

agtacaacag cctgttgctg categtcagt gccttcctac atg ccc cca ctg aat 115 

Met Pro Pro Leu Asn 



60 
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get gat gat gtt etc cct aga gac cat ctg tea gat gga agt ttc tea 163 

Ala Asp Asp Val Leu Pro Arg Asp His Leu Ser Asp Gly Ser Phe Ser 
10 15 20 

gat acg tat cca gac att aca acg caa gcg ate ate tta att ttc ttg 211 
Asp Thr Tyr Pro Asp lie Thr Thr Gin Ala lie lie Leu lie Phe Leu 
25 30 35 

gec eta teg cct ttc ctg gtc atg ttg etc act teg tat eta aag att 259 
Ala Leu Ser Pro Phe Leu Val Met Leu Leu Thr Ser Tyr Leu Lys lie 
40 45 50 

ate att act tta gtc tta tta cgt aac gec tta gga gta caa caa aca 307 
lie lie Thr Leu Val Leu Leu Arg Asn Ala Leu Gly Val Gin Gin Thr 
55 60 65 

cct ccc agt caa gtc etc aat ggg att gca etc ate eta tct att tat 355 
Pro Pro Ser Gin Val Leu Asn Gly lie Ala Leu lie Leu Ser lie Tyr 
70 75 80 85 

gtg atg ttc ccc acg gga gtg get atg tat aaa gat get cgc aag gaa 403 
Val Met Phe Pro Thr Gly Val Ala Met Tyr Lys Asp Ala Arg Lys Glu 
90 95 100 

ate gaa gee aat ace att cct caa age etc ttc act gca gaa ggt get 451 
lie Glu Ala Asn Thr lie Pro Gin Ser Leu Phe Thr Ala Glu Gly Ala 
105 110 115 

gaa aca gtg ttt gtc get tta aac aaa tct aaa gaa cct ttg cgc tct 499 
Glu Thr Val Phe Val Ala Leu Asn Lys Ser Lys Glu Pro Leu Arg Ser 
120 125 130 

ttc tta att cgc aac act cca aaa gca caa att caa age ttt tac aag 547 
Phe Leu lie Arg Asn Thr Pro Lys Ala Gin lie Gin Ser Phe Tyr Lys 
135 140 145 

ate tea cag aaa ace ttc cct teg gaa att cga gcg cac etc act gee 595 
lie Ser Gin Lys Thr Phe Pro Ser Glu lie Arg Ala His Leu Thr Ala 
150 155 160 165 

tec gac ttt gta ate att att cct get ttt att atg ggt cag ata aaa 643 
Ser Asp Phe Val lie lie He Pro Ala Phe He Met Gly Gin lie Lys 
170 175 180 

aat get ttc gaa att gga gtc ttg ate tat eta cct ttc ttt gtt att 691 
Asn Ala Phe Glu lie Gly Val Leu lie Tyr Leu Pro Phe Phe Val lie 
185 190 195 

gat tta gtg act get aac gtt ctt gta gcg atg cag atg atg atg tta 73 9 
Asp Leu Val Thr Ala Asn Val Leu Val Ala Met Gin Met Met Met Leu 
200 205 210 

tec cct eta teg att teg tta cct tta aag tta ctt ttg ate gtc atg 787 
Ser Pro Leu Ser lie Ser Leu Pro Leu Lys Leu Leu Leu He Val Met 
215 220 225 

gta gac gga tgg aca tta ctg etc caa ggg ctt atg ate age ttt aaa 83 5 
Val Asp Gly Trp Thr Leu Leu Leu Gin Gly Leu Met He Ser Phe Lys 
230 235 240 245 
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taaggacacg tgccgtgtta gcatttttcg caactagttt caaatctgtt ctttttgagt 895 
actcctacca atcattatta cttattttga ttgtttcggc acctcccatc atcttagctt 955 
ccatagtcgg gattatggtt gcgatcttcc aagccgcaac acaaa 100C 

<210> 4 

<211> 9 

<212> PRT 

<213> Chlamydia pneumoniae 
<220> 

<221> SITE 

<222> (188) . . (197) 

<400> 4 

Val Leu lie Tyr Leu Pro Phe Phe Val 



<210> 5 
<211> 900 
<212> DNA 

<213> Chlamydia pneumoniae 

<220> 
<221> CDS 

<222> (101) . . (799) 
<400> 5 

tagctttata caaagtatag aaaaataaca cgacaataaa aggagcggtg ttttctcttc 60 

tgaggtaaat cagcctcaaa gatactacgc catagtaaag atg aag ttt ttt age 115 

Met Lys Phe Phe Ser 



tta att ttt aaa gat gat gat gtc tec cca aat aag aag gtt tta tct 
Leu He Phe Lys Asp Asp Asp Val Ser Pro Asn Lys Lys Val Leu Ser 



cct gaa get ttc tct get ttc ctt gat gee aaa gag ctg tta gaa aaa 
Pro Glu Ala Phe Ser Ala Phe Leu Asp Ala Lys Glu Leu Leu Glu Lys 



aca aaa gec gat age gaa gee tat gtt gca gag aca gaa caa aag tgt 
Thr Lys Ala Asp Ser Glu Ala Tyr Val Ala Glu Thr Glu Gin Lys Cys 



gca caa att cgt caa gaa get aaa gat caa gga ttt aaa gag gga tct 
Ala Gin He Arg Gin Glu Ala Lys Asp Gin Gly Phe Lys Glu Gly Ser 



gaa tec tgg age aag caa att get ttc tta gaa gaa gaa act aaa aat 
Glu Ser Trp Ser Lys Gin lie Ala Phe Leu Glu Glu Glu Thr Lys Asn 



eta cgc ata aga gta cgc gag gec ttg gtt cct ctg gca att gcg agt 
Leu Arg He Arg Val Arg Glu Ala Leu Val Pro Leu Ala lie Ala Ser 
60 90 95 100 



403 



O *3 O 3 O H-H- itS O S ?? 'B 'O i" 



12/83 



gtg agg aaa ate att ggg aag gaa etc gaa tta cat cct gaa act att 451 

Val Arg Lys lie lie Gly Lys Glu Leu Glu Leu His Pro Glu Thr lie 

105 110 115 

gtc tct att att tct caa gca ttg aaa gag etc aca caa aat aaa cat 499 

Val Ser lie lie Ser Gin Ala Leu Lys Glu Leu Thr Gin Asn Lys His 

120 125 130 

ate att ate tct gtc aat ccc aaa gat tta cct ctt gtt gag aaa agt 547 

lie lie He Ser Val Asn Pro Lys Asp Leu Pro Leu Val Glu Lys Ser 

135 140 145 

cgt cct gaa etc aag aac ate gtg gag tat get gac tec tta att ctt 595 

Arg Pro Glu Leu Lys Asn He Val Glu Tyr Ala Asp Ser Leu lie Leu 

150 155 160 165 

aca gca aaa cct gat gtt act cct ggg ggt tgc att ate gag act gaa 643 

Thr Ala Lys Pro Asp Val Thr Pro Gly Gly Cys He He Glu Thr Glu 
170 175 180 

gca ggg ate ate aat gcg cag ctt gat gta caa tta gat gee tta gaa 691 

Ala Gly He He Asn Ala Gin Leu Asp Val Gin Leu Asp Ala Leu Glu 

185 190 195 

aaa get ttc teg act ata eta aaa gcg aag aac cct gta gac gag cca 73 9 

Lys Ala Phe Ser Thr He Leu Lys Ala Lys Asn Pro Val Asp Glu Pro 

200 205 210 

tct gag act tea tea tec acg gat tct tct tct tta tct aat gat cag 787 

Ser Glu Thr Ser Ser Ser Thr Asp Ser Ser Ser Leu Ser Asn Asp Gin 

215 220 225 

gat aag aaa gaa taaaggtatt cactattatg cgatccattt ttcgattttc 839 

Asp Lys Lys Glu 

230 

cctttgtttt tttacgctga gcgtctcatg ctgatttget gacgccagtc tatatgaaaa 899 



<210> 6 

<211> 9 

<212> PRT 

<213> Chlamydia pne 
<220> 

<221> SITE 

<222> (163) . . (171) 



<400> 6 

Leu He Leu Thr Ala Lys Pro Asp Val 



<210> 7 
<211> 1200 
<212> DNA 
6 0 <213> Chlamydia pneumoniae 
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<220> 
<221> CDS 

<222> (101) . . (1066) 
<400> 7 

tgttcgcgat tggcactaat cccccctttt gttatggtga ataaaaaggt atgcgtggat 60 

tatggttcgt cgatctattt ctttttgctt gttctttcta atg aca ttg ctg tgc 115 

Met Thr Leu Leu Cys 
1 5 

tgt aca age tgt aac age agg tct eta att gtg cac ggt ctt cct ggc 163 
Cys Thr Ser Cys Asn Ser Arg Ser Leu lie Val His Gly Leu Pro Gly 



aga gaa gcg aat gag att gtg gtg ctt ttg gta age aaa ggg gtg get 
Arg Glu Ala Asn Glu lie Val Val Leu Leu Val Ser Lys Gly Val Ala 



gca caa aaa ttg cct caa get gca gcg get aca gec gga gca get act 
Ala Gin Lys Leu Pro Gin Ala Ala Ala Ala Thr Ala Gly Ala Ala Thr 



gag caa atg tgg gat ate gcg gtt ccg tea gca caa ate aca gag gec 
Glu Gin Met Trp Asp lie Ala Val Pro Ser Ala Gin lie Thr Glu Ala 



ctt gec att eta aat caa gcg ggt ctt cca cgt atg aaa ggg aca age 
Leu Ala lie Leu Asn Gin Ala Gly Leu Pro Arg Met Lys Gly Thr Ser 



ctg tta gat ctt ttt gca aaa caa ggt ctt gtt cct tec gag ctt cag 

Leu Leu Asp Leu Phe Ala Lys Gin Gly Leu Val Pro Ser Glu Leu Gin 

90 95 100 

gaa aaa ate cgt tat caa gaa ggc tta tea gaa cag atg gec tct acg 

Glu Lys lie Arg Tyr Gin Glu Gly Leu Ser Glu Gin Met Ala Ser Thr 

105 110 115 

att aga aaa atg gat ggc gtt gtc gat gee tea gta cag att tec ttc 

lie Arg Lys Met Asp Gly Val Val Asp Ala Ser Val Gin lie Ser Phe 

120 125 130 

act aca gaa aat gaa gat aat ctt cct tta aca gec tct gtg tat att 

Thr Thr Glu Asn Glu Asp Asn Leu Pro Leu Thr Ala Ser Val Tyr lie 

135 140 145 

aag cat cga ggg gtt ttg gac aat ccg aac age att atg gtt tec aaa 

Lys His Arg Gly Val Leu Asp Asn Pro Asn Ser lie Met Val Ser Lys 

150 155 160 165 

att aag cgc ctt att gca agt get gtt cca gga ctt gtg cca gag aac 

lie Lys Arg Leu lie Ala Ser Ala Val Pro Gly Leu Val Pro Glu Asn 

170 175 180 



gtc tct gta gtg age gat cgc gca get tat agt gat att aca att aat 
Val Ser Val Val Ser Asp Arg Ala Ala Tyr Ser Asp lie Thr lie Asn 
60 185 190 195 



691 
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ggt cct tgg gga tta aca gaa gaa ate gat tat gtt tct gtt tgg ggt 73 9 
Gly Pro Trp Gly Leu Thr Glu Glu lie Asp Tyr Val Ser Val Trp Gly 
200 205 210 

att att ctt gcg aag tct teg etc acc aaa ttc cgt etc att ttt tat 787 

lie lie Leu Ala Lys Ser Ser Leu Thr Lys Phe Arg Leu lie Phe Tyr 

215 220 225 

10 

gtc ttg att etc att tta ttt gtt att tct tgt ggt etc ctt tgg gtc 835 

Val Leu lie Leu lie Leu Phe Val lie Ser Cys Gly Leu Leu Trp Val 

230 235 240 245 

att tgg aaa act cat act etc att atg act atg gga ggt aca aaa ggg 883 
He Trp Lys Thr His Thr Leu He Met Thr Met Gly Gly Thr Lys Gly 
250 255 260 

ttc ttc aac cct aca cca tat aca aag aat gec ttg gaa gec aag aaa 931 
2 0 Phe Phe Asn Pro Thr Pro Tyr Thr Lys Asn Ala Leu Glu Ala Lys Lys 
265 270 275 

gec gag gga gca get get gac aaa gag aaa aaa gaa gat gca gat tea 97 9 
Ala Glu Gly Ala Ala Ala Asp Lys Glu Lys Lys Glu Asp Ala Asp Ser 
280 285 290 

cag ggg gaa age aaa aat gcg gaa acc agt gat aaa gac tct agt gat 1027 
Gin Gly Glu Ser Lys Asn Ala Glu Thr Ser Asp Lys Asp Ser Ser Asp 
295 300 305 

30 

aaa gat get cca gaa gga age aat gaa att gag ggt get tagtgactgc 1076 
Lys Asp Ala Pro Glu Gly Ser Asn Glu lie Glu Gly Ala 
310 315 320 

caacactttt ggaactctag acatcttgat gaagcactcc aaggaagatg acctctccag 1136 
gtttcttcct aaaaatcttc ttgttgaatc tcctcatccc gaagaaatcc ctttaaaatc 1196 
ttta 1200 



40 <210> 8 
<211> 9 
<212> PRT 

<213> Chlamydia pneumoniae 
<220> 

<221> SITE 

<222> (201) . . (209) 

<400> 8 

5 0 Gly Leu Thr Glu Glu lie Asp Tyr Val 



<210> 9 
<211> 850 
<212> DNA 

<213> Chlamydia pneumoniae 



<220> 
60 <221> CDS 

<222> (101) . . (778) 
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<400> 9 

ttgaacccta tggaaatgta tcttatttgt gctgggctat atttcttaat gacaacatca 60 

ttttcctgta tttctaggtt atcagaaaag agaaggagtt atg aca att aga gtc 115 

Met Thr He Arg Val 
1 5 

10 cga aac ctt gcc tac tct gta aat aag aaa aag att eta gat ggt gta 163 
Arg Asn Leu Ala Tyr Ser Val Asn Lys Lys Lys He Leu Asp Gly Val 
10 15 20 

act ttt tct tta gag cga ggg cac att aca ctg ttt gtt ggg aag agt 211 
Thr Phe Ser Leu Glu Arg Gly His He Thr. Leu Phe Val Gly Lys Ser 
25 30 35 

ggt tea gga aaa aca atg att tta cgt get ttg gcg ggc tta gtc cag 259 
Gly Ser Gly Lys Thr Met He Leu Arg Ala Leu Ala Gly Leu Val Gin 
20 40 45 50 

ccc act caa gga gat att tgg att gaa ggg gag get cca get eta gtt 3 07 
Pro Thr Gin Gly Asp He Trp lie Glu Gly Glu Ala Pro Ala Leu Val 
55 60 65 

ttc caa caa ccc gag tta ttt tec cat atg aca gta tta gga aat tgc 355 
Phe Gin Gin Pro Glu Leu Phe Ser His Met Thr Val Leu Gly Asn Cys 
70 75 80 85 

3 0 acc cat cca caa ate cat ate aag ggt cgt agt ace gaa gaa get cga 403 
Thr His Pro Gin He His He Lys Gly Arg Ser Thr Glu Glu Ala Arg 
90 95 100 

gaa aag gcg ttc gag ctt tta cat ttg ttg gat att gaa gag gtt get 451 
Glu Lys Ala Phe Glu Leu Leu His Leu Leu Asp He Glu Glu Val Ala 
105 110 115 

aag aat tat cct gac cag etc tct ggg gga caa aaa caa cgt gtg get 4 99 
Lys Asn Tyr Pro Asp Gin Leu Ser Gly Gly Gin Lys Gin Arg Val Ala 
40 120 125 130 

att gta cgt tct tta tgt atg gat aaa cat aca tta ctt ttt gat gaa 547 
He Val Arg Ser Leu Cys Met Asp Lys His Thr Leu Leu Phe Asp Glu 
135 140 145 

cct aca teg get tta gat cct ttt get acg gca teg ttc cga cat ctt 595 
Pro Thr Ser Ala Leu Asp Pro Phe Ala Thr Ala Ser Phe Arg His Leu 
150 155 160 165 

5 0 tta gaa aca ctt cga gac cag gaa ctg act gta ggg tta act act cat 643 
Leu Glu Thr Leu Arg Asp Gin Glu Leu Thr Val Gly Leu Thr Thr His 
170 175 180 

gac atg caa ttt gtt cat agt tgt ttg gat cgt ate tat ctt ata gat 691 
Asp Met Gin Phe Val His Ser Cys Leu Asp Arg He Tyr Leu He Asp 
185 190 195 



caa gga act gtt gcg ggg gtc tat gac aag cgt gac gga gag etc gat 
Gin Gly Thr Val Ala Gly Val Tyr Asp Lys Arg Asp Gly Glu Leu Asp 
60 200 205 210 



739 
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tct ggt cat cca tta teg aaa tat ate cac tct get caa taggactaca 788 
Ser Gly His Pro Leu Ser Lys Tyr lie His Ser Ala Gin 
215 220 225 

getgetagag cagctgtagt gatactttag aatcctgacc agtggcagga atgageggea 84 8 
tg 850 

<210> 10 
<211> 9 
<212> PRT 

<213> Chlamydia pneumoniae 
<220> 

<221> SITE 

<222> (145) . . (153) 

<400> 10 

Leu Leu Phe Asp Glu Pro Thr Ser Ala 



<210> 11 

<211> 1500 

<212> DNA 

<213> Chlamydia pneumoniae 

<220> 
<221> CDS 

<222> (101) . . (1408) 
<400> 11 

ctctgattta tggtaattct ttattttcag ageegtcaag tcctttctat tctgttgaat 60 

ttcctaataa cgtaagtaat aaacaatcaa aagtcegcat atg aaa aga cct ttt 115 

Met Lys Arg Pro Phe 

1 5 

ttt acc tat eta tgc ate ate ttc tac gga tct tgt gca teg tta tct 163 

Phe Thr Tyr Leu Cys lie lie Phe Tyr Gly Ser Cys Ala Ser Leu Ser 

10 15 20 

tta cat gca gga etc tct ttc cca gaa gta cgt gga get acg get get 211 
Leu His Ala Gly Leu Ser Phe Pro Glu Val Arg Gly Ala Thr Ala Ala 
25 30 35 

gtt gtc cat gee gac tct ggg aag gta ttc tat gat aaa gac ata gat 259 
Val Val His Ala Asp Ser Gly Lys Val Phe Tyr Asp Lys Asp lie Asp 
40 45 50 

get gta ate tat cct gec age atg acg aaa ate gca act gec etc ttt 3 07 
Ala Val lie Tyr Pro Ala Ser Met Thr Lys lie Ala Thr Ala Leu Phe 



60 



ate eta aag cac tat ccc aca gtc etc gat act etc ate aaa gtc aaa 
lie Leu Lys His Tyr Pro Thr Val Leu Asp Thr Leu lie Lys Val Lys 
70 75 80 85 



355 
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caa gat gcg ate get tec ate act ccg caa gca aaa aaa caa tea gga 
Gin Asp Ala lie Ala Ser lie Thr Pro Gin Ala Lys Lys Gin Ser Gly 
90 95 100 

tat cgt agt cct ccc cac tgg tta gaa act gat gga tct aca ata cag 
Tyr Arg Ser Pro Pro His Trp Leu Glu Thr Asp Gly Ser Thr He Gin 
105 110 115 

etc cat ctt cga gaa gag ctt tta ggg tgg gac ctg ttc cac gee tta 
Leu His Leu Arg Glu Glu Leu Leu Gly Trp Asp Leu Phe His Ala Leu 
120 125 130 

ctg gtc tgt tct get aat gat get gcg aat gtc tta get atg gca tgt 
Leu Val Cys Ser Ala Asn Asp Ala Ala Asn Val Leu Ala Met Ala Cys 
135 140 145 

tgc gga tct gta gag aag ttt atg gat aag ctg aac ttc ttc tta aaa 
Cys Gly Ser Val Glu Lys Phe Met Asp Lys Leu Asn Phe Phe Leu Lys 
150 155 160 165 

gaa gaa ate ggc tgc act cat acc cat ttt aat aat ccc cat ggg tta 
Glu Glu He Gly Cys Thr His Thr His Phe Asn Asn Pro His Gly Leu 
170 175 180 

cat cat ccg aat cac tat act aca acc cgt gat ctt att age ate atg 
His His Pro Asn His Tyr Thr Thr Thr Arg Asp Leu lie Ser He Met 
185 190 195 

cgt tgc get ctg aaa gaa cct cca ttt cga ggg gtc ate tec acg aca 
Arg Cys Ala Leu Lys Glu Pro Pro Phe Arg Gly Val lie Ser Thr Thr 
200 205 210 

age tat aaa ata ggg get aca aac ctg cat ggc gaa egg ate eta tec 
Ser Tyr Lys He Gly Ala Thr Asn Leu His Gly Glu Arg He Leu Ser 
215 220 225 

cca aca aac aaa ttg ctt ctt cct ggg tct acc tac cac tat ccc cca 
Pro Thr Asn Lys Leu Leu Leu Pro Gly Ser Thr Tyr His Tyr Pro Pro 
230 235 240 245 

get tta gga ggg aaa aca ggg acc acc aag act gca ggg aaa aat eta 
Ala Leu Gly Gly Lys Thr Gly Thr Thr Lys Thr Ala Gly Lys Asn Leu 
250 255 260 

att atg get get gaa aaa aat aac cgc etc ttg gta acg ate gca acg 
He Met Ala Ala Glu Lys Asn Asn Arg Leu Leu Val Thr He Ala Thr 
265 270 275 

ggc tat teg ggt cct gtg agt gat etc tac caa gat gtc att get eta 
Gly Tyr Ser Gly Pro Val Ser Asp Leu Tyr Gin Asp Val He Ala Leu 
280 285 290 

tgt gaa acg gta ttt aac gag ccg eta tta aga aaa gag etc gtc ccc 
Cys Glu Thr Val Phe Asn Glu Pro Leu Leu Arg Lys Glu Leu Val Pro 
295 300 305 

ccc tec gac tgt etc caa tta gaa ata gcg aat ctt ggg aag ctt tct 
Pro Ser Asp Cys Leu Gin Leu Glu He Ala Asn Leu Gly Lys Leu Ser 
310 315 320 325 
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tgc cct ctt cct gag gga etc tac tat gac ttc tat gec tec gaa gat 
Cys Pro Leu Pro Glu Gly Leu Tyr Tyr Asp Phe Tyr Ala Ser Glu Asp 
330 335 340 



cgc gaa cct ctt tct gta tct ttt att gca cat gcg gac gec ttc cct 

Arg Glu Pro Leu Ser Val Ser Phe lie Ala His Ala Asp Ala Phe Pro 

345 350 355 

10 

att gaa caa gga gat ctt ctt ggt cat tgg gtt ttt tat gac gat gaa 

lie Glu Gin Gly Asp Leu Leu Gly His Trp Val Phe Tyr Asp Asp Glu 

360 365 370 

ggc aag aaa att tct tec cag cct ttc tat gec cct tgt cgt ttt gag 

Gly Lys Lys lie Ser Ser Gin Pro Phe Tyr Ala Pro Cys Arg Phe Glu 
375 380 385 

cgc act ate aag cct tgg aaa etc tat atg aaa cgt gtc ttc aca teg 

2 0 Arg Thr lie Lys Pro Trp Lys Leu Tyr Met Lys Arg Val Phe Thr Ser 
390 395 400 405 

tat aga acc tat atg tct ata ace atg ctg etc atg tat ttt cgc ate 

Tyr Arg Thr Tyr Met Ser lie Thr Met Leu Leu Met Tyr Phe Arg lie 
410 415 420 



cgc aag cac cgc aag tat aaa aat tta aaa cac tat tct aaa ate 140 8 

Arg Lys His Arg Lys Tyr Lys Asn Leu Lys His Tyr Ser Lys lie 
425 430 435 

30 

taactttttc ttttaattta taaaaaacca aaggtttatg taagatttgc gcttttcaat 1468 
ccaacaagaa tcccttgtgc gcacattact tt 1500 



<210> 12 

<211> 9 

<212> PRT 

<213> Chlamydia pneumoniae 



40 <220> 

<221> SITE 

<222> (156) . . (164) 



<400> 12 

Phe Met Asp Lys Leu Asn Phe Phe Leu 
1 5 



<210> 13 

<211> 2100 

<212> DNA 

<213> Chlamydia pneumoniae 



<220> 
<221> CDS 

<222> (101) . . (2035) 



<400> 13 

aaggagcaaa tggagattgg ccaaatagac gagcaagggt ttgeataaga atagectttt 60 

60 
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tcgcaataat aacttgccta aacgatcttg taaacgactt atg get tct aat ccc 

Met Ala Ser Asn Pro 



att tta cag ata gag gat eta tec ata ace ttg gca aaa caa cgc caa 
He Leu Gin He Glu Asp Leu Ser lie Thr Leu Ala Lys Gin Arg Gin 



cag tac ccc ate gtc caa tct tta teg ttt act ate aat gaa gga caa 
Gin Tyr Pro He Val Gin Ser Leu Ser Phe Thr lie Asn Glu Gly Gin 



acc tta gca ate att gga gaa tea gga tea gga aaa tct gtc tct gcg 
Thr Leu Ala lie lie Gly Glu Ser Gly Ser Gly Lys Ser Val Ser Ala 



cat gca ate ctt cga tta ctt cct tgc ccc cca ttt tct gtt tct ggc 
His Ala lie Leu Arg Leu Leu Pro Cys Pro Pro Phe Ser Val Ser Gly 



cag gtc aac ttc caa ggc cac aac tta ctt acg get teg cgc tct ata 
Gin Val Asn Phe Gin Gly His Asn Leu Leu Thr Ala Ser Arg Ser lie 



caa aaa aag att ata ggg aca gaa att tct atg ate ttt caa aac ccg 

Gin Lys Lys He He Gly Thr Glu He Ser Met He Phe Gin Asn Pro 

90 95 100 

caa gca tct eta aac ccc gtg ttt act att gaa cag cag ttt cga gaa 
Gin Ala Ser Leu Asn Pro Val Phe Thr He Glu Gin Gin Phe Arg Glu 

105 110 115 

att att cat acc cac eta gee tta act gca gaa gtt get aaa gaa aag 

He lie His Thr His Leu Ala Leu Thr Ala Glu Val Ala Lys Glu Lys 

120 125 130 

atg tta tac get ctt gaa gaa aca ggg ttt cat gat ccc agg ctg tgc 

Met Leu Tyr Ala Leu Glu Glu Thr Gly Phe His Asp Pro Arg Leu Cys 
135 140 145 

ttg aat etc tac ccc cac caa etc tct gga ggg atg ctt caa aga att 

Leu Asn Leu Tyr Pro His Gin Leu Ser Gly Gly Met Leu Gin Arg He 

150 155 160 165 

tgc att gec atg gcg etc etc tgt tct cct aaa ctt ctt att get gat 

Cys He Ala Met Ala Leu Leu Cys Ser Pro Lys Leu Leu He Ala Asp 

170 175 180 

gaa cct acg act get tta gat gtt tct gtt cag tat cag att eta caa 

Glu Pro Thr Thr Ala Leu Asp Val Ser Val Gin Tyr Gin He Leu Gin 

185 190 195 

tta eta aaa aca eta cag aaa aaa acg gga atg age ctt ctt att att 

Leu Leu Lys Thr Leu Gin Lys Lys Thr Gly Met Ser Leu Leu He He 

200 205 210 

acc cat aat atg gga gtc gtt gca gaa act get gat gac gtg etc gtg 

Thr His Asn Met Gly Val Val Ala Glu Thr Ala Asp Asp Val Leu Val 
215 220 225 
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etc tat gca gga cgc atg gta gaa tgt gec cct gcg gtt caa atg ttc 

Leu Tyr Ala Gly Arg Met Val Glu Cys Ala Pro Ala Val Gin Met Phe 
230 235 240 245 

cat aat cct tct cat ccc tat acc cga gat ctt tta gca tec aga ccc 

His Asn Pro Ser His Pro Tyr Thr Arg Asp Leu Leu Ala Ser Arg Pro 
250 255 260 

tct eta caa ccg caa caa eta ggt tec ttc aac ccc att cca gga cag 

Ser Leu Gin Pro Gin Gin Leu Gly Ser Phe Asn Pro lie Pro Gly Gin 

265 270 275 

ccc cca cac tac acg gee ttt ccc teg gga tgt cgc tat cac cct aga 

Pro Pro His Tyr Thr Ala Phe Pro Ser Gly Cys Arg Tyr His Pro Arg 

280 285 290 

tgc tea aaa att tta aat cga tgt tct gcg gaa get cca gaa ate tat 

Cys Ser Lys lie Leu Asn Arg Cys Ser Ala Glu Ala Pro Glu lie Tyr 
295 300 305 

ccg gta cgc gaa ggt cac aaa gta agg gtt ggc tgt atg acg act aat 

Pro Val Arg Glu Gly His Lys Val Arg Val Gly Cys Met Thr Thr Asn 
310 315 320 325 

ttt ccc caa cct tta att caa gca acc tea tta aca aag cac tat tac 

Phe Pro Gin Pro Leu lie Gin Ala Thr Ser Leu Thr Lys His Tyr Tyr 
330 335 340 

aag cgt tec ttt tgg ttt cag gga aag aca att gee agt cgt cct gtt 

Lys Arg Ser Phe Trp Phe Gin Gly Lys Thr lie Ala Ser Arg Pro Val 

345 350 355 

gac gac gtc tct ttt tea eta tac tec aga cgt get gtc gga ctt att 

Asp Asp Val Ser Phe Ser Leu Tyr Ser Arg Arg Ala Val Gly Leu lie 

360 365 370 

gga gaa tct gga tea ggg aaa agt acc ctg gcg tta get etc gca ggt 

Gly Glu Ser Gly Ser Gly Lys Ser Thr Leu Ala Leu Ala Leu Ala Gly 
375 380 385 

etc eta cct etc acc tct ggg ttc tta act ttt aac ggc acc cca ate 

Leu Leu Pro Leu Thr Ser Gly Phe Leu Thr Phe Asn Gly Thr Pro lie 
390 395 400 405 

aag ttg cat tct aaa cac gga cgc cat caa tta cga tct caa gta egg 

Lys Leu His Ser Lys His Gly Arg His Gin Leu Arg Ser Gin Val Arg 
410 415 420 

ttg gtc ttt caa aat cca caa get tea tta aac ccg cga aaa act ate 

Leu Val Phe Gin Asn Pro Gin Ala Ser Leu Asn Pro Arg Lys Thr lie 

425 430 435 

eta gat agt tta ggc cac tct ctg ctt tac cat aaa etc gtc cca aaa 

Leu Asp Ser Leu Gly His Ser Leu Leu Tyr His Lys Leu Val Pro Lys 

440 445 450 

gaa aaa gta eta gca acg gta agg gaa tat tta gaa ttg gta ggg tta 

Glu Lys Val Leu Ala Thr Val Arg Glu Tyr Leu Glu Leu Val Gly Leu 
455 460 465 
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tct gag gag tat ttt tat cgt tat cct cac cag ctt tct gga gga caa 1555 

Ser Glu Glu Tyr Phe Tyr Arg Tyr Pro His Gin Leu Ser Gly Gly Gin 

470 475 480 485 

caa caa cga gtc tct ata gcg aga gcc eta tta gga gtc cct cag tta 1603 

Gin Gin Arg Val Ser lie Ala Arg Ala Leu Leu Gly Val Pro Gin Leu 
490 495 500 

att att tgt gac gaa att gtt tct get eta gat tta tct att caa gca 1651 

lie lie Cys Asp Glu lie Val Ser Ala Leu Asp Leu Ser lie Gin Ala 
505 510 515 

caa att ctg aat atg ctt gcc gag ctg caa aaa aaa etc age etc aca 1699 

Gin lie Leu Asn Met Leu Ala Glu Leu Gin Lys Lys Leu Ser Leu Thr 

520 525 530 

tat etc ttc att teg cat gat ctt gcc gtt gta cgc teg ttc tgc aca 1747 

Tyr Leu Phe lie Ser His Asp Leu Ala Val Val Arg Ser Phe Cys Thr 

535 540 545 

gag gta ttc att atg tat aag ggg caa att gta gaa aaa gga aat aca 1795 

Glu Val Phe lie Met Tyr Lys Gly Gin lie Val Glu Lys Gly Asn Thr 

550 555 560 565 

aaa cgc att ttt tct gat cca caa cat cct tat acg cgc atg ttg tta 1843 

Lys Arg lie Phe Ser Asp Pro Gin His Pro Tyr Thr Arg Met Leu Leu 
570 575 580 

aat gcc caa ctt cca gag act cct gat caa agg caa tct aaa cct ata 1891 

Asn Ala Gin Leu Pro Glu Thr Pro Asp Gin Arg Gin Ser Lys Pro lie 
585 590 595 

ttc caa gaa tat cac aaa gat tct gaa gaa tct tgc tct aca gga tgc 193 9 

Phe Gin Glu Tyr His Lys Asp Ser Glu Glu Ser Cys Ser Thr Gly Cys 

600 605 610 

tac ttt tac aat cgt tgt cca caa aaa caa gaa get tgc aag tea gag 1987 

Tyr Phe Tyr Asn Arg Cys Pro Gin Lys Gin Glu Ala Cys Lys Ser Glu 

615 620 625 

ate ate cca aat caa gga gac gcg cac cat aca tac cgt tgt ate cat 2035 

lie lie Pro Asn Gin Gly Asp Ala His His Thr Tyr Arg Cys lie His 

630 635 640 645 

tgattegtec tetaegctat tcttaagcta ccattaagga atcccaaggg agaggtctgc 2095 
tctat 2100 



<210> 14 
<211> 9 
<212> PRT 

<213> Chlamydia pneumoniae 



<220> 

<221> SITE 
<222> (59) . . (67) 
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<400> 14 

Leu Leu Pro Cys Pro Pro Phe Ser Val 
1 5 



<210> 15 
<211> 2300 
<212> DNA 

<213> Chlamydia pneumoniae 

<220> 
<221> CDS 

<222> (101) . . (2194) 
<400> 15 

cgaagagcaa acctccacag ttacagagaa agacgtccaa cctaaaacac aagcaacacc 60 

acacgcttcg aagaaaaacg ttgcaagtcc ttcgacctct atg cca gga ate gag 115 

Met Pro Gly lie Glu 



aaa gca gca aca aca gtg get gta cct caa gac aaa tct gaa gaa gaa 
Lys Ala Ala Thr Thr Val Ala Val Pro Gin Asp Lys Ser Glu Glu Glu 



aaa gtt aaa gag cga ttg aca aag egg gaa ctt acc tgt gaa gac ctt 
Lys Val Lys Glu Arg Leu Thr Lys Arg Glu Leu Thr Cys Glu Asp Leu 



aaa gat aac ggc tat act gtc aat ttt gaa gac att tct att tta gag 

Lys Asp Asn Gly Tyr Thr Val Asn Phe Glu Asp lie Ser lie Leu Glu 

40 45 50 

ttg ttg cag ttc gta agt aaa att tct gga acg aac ttt gtc ttt gat 

Leu Leu Gin Phe Val Ser Lys lie Ser Gly Thr Asn Phe Val Phe Asp 



age aac gat ttg caa ttc aat gtc acg ate gtt tec cac gat cct act 
Ser Asn Asp Leu Gin Phe Asn Val Thr lie Val Ser His Asp Pro Thr 



tct gta gat gat tta tct aca ate tta eta caa gtc tta aaa atg cat 

Ser Val Asp Asp Leu Ser Thr lie Leu Leu Gin Val Leu Lys Met His 
90 95 100 

gac ttg aag gtt gtt gaa caa ggc aat aac gtc ctt ate tat cgt aat 

Asp Leu Lys Val Val Glu Gin Gly Asn Asn Val Leu lie Tyr Arg Asn 

50 105 110 115 

cct cat ctt tct aag eta tec aca gta gtc aca gac age tec tta aaa 

Pro His Leu Ser Lys Leu Ser Thr Val Val Thr Asp Ser Ser Leu Lys 

120 125 130 

gaa acg tgt gaa get gtt gtg gtt acc cga gtg ttc cgt ctt tac agg 

Glu Thr Cys Glu Ala Val Val Val Thr Arg Val Phe Arg Leu Tyr Arg 

135 140 145 



60 



0 1C 3H 8 3 €1 'NPjHS ,, O 5 E? !E 



23/83 



cgt cag ccc tct gca gca gta aat att att caa cct tta ctt tec cat 595 
Arg Gin Pro Ser Ala Ala Val Asn lie lie Gin Pro Leu Leu Ser His 
150 155 160 165 

gat get ate gtt agt get tea gaa get act cgt cat gtt ate ate teg 643 
Asp Ala lie Val Ser Ala Ser Glu Ala Thr Arg His Val lie lie Ser 
170 175 180 

gat att get ggt aat gtc gat aaa gtc agt gat ttg eta gca get eta 691 
Asp lie Ala Gly Asn Val Asp Lys Val Ser Asp Leu Leu Ala Ala Leu 
185 190 195 

gat tgc cca ggc aca tct gtg gac atg act gaa tac gaa gtt aaa tat 73 9 
Asp Cys Pro Gly Thr Ser Val Asp Met Thr Glu Tyr Glu Val Lys Tyr 
200 205 210 

gec aat ccc gca get ctt gtt age tac tgc caa gat gtt ctt ggt act 787 
Ala Asn Pro Ala Ala Leu Val Ser Tyr Cys Gin Asp Val Leu Gly Thr 
215 220 225 

ctg gee gaa gat gat get ttc caa atg ttc ate caa cct gga acg aac 835 
Leu Ala Glu Asp Asp Ala Phe Gin Met Phe lie Gin Pro Gly Thr Asn 
230 235 240 245 

aaa att ttc gtc gtc tct tea cca cgt ctt gca aat aag gca gag cag 883 
Lys lie Phe Val Val Ser Ser Pro Arg Leu Ala Asn Lys Ala Glu Gin 
250 255 260 

etc ctg aag tec tta gat gtc cca gaa atg gca cat acc eta gat gat 931 
Leu Leu Lys Ser Leu Asp Val Pro Glu Met Ala His Thr Leu Asp Asp 
265 270 275 

cct gca agt act gee ttg get ttg gga gga aca gga acc acg age cct 97 9 
Pro Ala Ser Thr Ala Leu Ala Leu Gly Gly Thr Gly Thr Thr Ser Pro 
280 285 290 

aag agt ttg egg ttc ttt atg tac aag ctg aag tat caa aat gga gaa 1027 
Lys Ser Leu Arg Phe Phe Met Tyr Lys Leu Lys Tyr Gin Asn Gly Glu 
295 300 305 

gtg att get aat gec etc caa gat ate ggt tac aat eta tat gta acc 1075 
Val lie Ala Asn Ala Leu Gin Asp lie Gly Tyr Asn Leu Tyr Val Thr 
310 315 320 325 

aca get atg gac gaa gat ttc att aac act etc aat agt ate cag tgg 1123 
Thr Ala Met Asp Glu Asp Phe lie Asn Thr Leu Asn Ser lie Gin Trp 
330 335 340 

tta gag gtc aat aac tec ata gtt att ate gga aac caa ggg aat gtc 1171 
Leu Glu Val Asn Asn Ser lie Val lie lie Gly Asn Gin Gly Asn Val 
345 350 355 

gac aga gtt att ggc etc tta aac ggt tta gat tta cct cct aaa cag 1219 
Asp Arg Val lie Gly Leu Leu Asn Gly Leu Asp Leu Pro Pro Lys Gin 
360 365 370 

gtt tac ate gaa gtt tta att eta gat acc age tta gag aaa tec tgg 1267 
Val Tyr lie Glu Val Leu lie Leu Asp Thr Ser Leu Glu Lys Ser Trp 
375 380 385 
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gac ttt gga gtg caa tgg gta gcc eta ggt gat gaa caa agt aaa gta 
Asp Phe Gly Val Gin Trp Val Ala Leu Gly Asp Glu Gin Ser Lys Val 
390 395 400 405 

get tat get tct gga eta ttg aat aat act ggc ata gcc aca cct aca 
Ala Tyr Ala Ser Gly Leu Leu Asn Asn Thr Gly lie Ala Thr Pro Thr 
410 415 420 

aaa gca act gtc cct ccc ggc acg cca aat cct ggt teg ate cct ctt 
Lys Ala Thr Val Pro Pro Gly Thr Pro Asn Pro Gly Ser lie Pro Leu 
425 430 435 

cct acg cca gga caa ttg aca ggg ttc tea gat atg ctg aac tct teg 
Pro Thr Pro Gly Gin Leu Thr Gly Phe Ser Asp Met Leu Asn Ser Ser 
440 445 450 

tea gca ttc ggt eta gga ate ate gga aat gtc eta agt cat aaa ggg 
Ser Ala Phe Gly Leu Gly lie lie Gly Asn Val Leu Ser His Lys Gly 
455 460 465 

aag tct ttc ctt act ttg gga ggc tta tta agt gcc tta gat caa gat 
Lys Ser Phe Leu Thr Leu Gly Gly Leu Leu Ser Ala Leu Asp Gin Asp 
470 475 480 485 

gga gat act gtc att gtc ttg aat cct aga ate atg get cag gat acg 
Gly Asp Thr Val lie Val Leu Asn Pro Arg lie Met Ala Gin Asp Thr 
490 495 500 

caa caa get teg ttt ttt gta ggg caa acg gtc cct tac caa act ate 
Gin Gin Ala Ser Phe Phe Val Gly Gin Thr Val Pro Tyr Gin Thr lie 
505 510 515 

aaa tac tat ate caa gaa aca gga act gta acg caa aat ate gat tat 
Lys Tyr Tyr lie Gin Glu Thr Gly Thr Val Thr Gin Asn lie Asp Tyr 
520 525 530 

gaa gat att gga gtg aac ctt gtc gtt ace tct aca gtt get ccc aac 
Glu Asp lie Gly Val Asn Leu Val Val Thr Ser Thr Val Ala Pro Asn 
535 540 545 

aat gta gtt aca eta caa ate gaa cag acg ate tea gaa tta cat tec 
Asn Val Val Thr Leu Gin lie Glu Gin Thr lie Ser Glu Leu His Ser 
550 555 560 565 

gcg tct gga tea eta aca cct gtc aca gat aaa act tat gca gcc aca 
Ala Ser Gly Ser Leu Thr Pro Val Thr Asp Lys Thr Tyr Ala Ala Thr 
570 575 580 

cgc tta caa att ccc gac ggt tgt ttc tta gtt atg agt ggg cat ate 
Arg Leu Gin lie Pro Asp Gly Cys Phe Leu Val Met Ser Gly His lie 
585 590 595 

aga gat aaa act aca aaa gtg gtt tea gga gtg cct ttg eta aac tec 
Arg Asp Lys Thr Thr Lys Val Val Ser Gly Val Pro Leu Leu Asn Ser 
600 605 610 

ata cca tta att cgt ggt tta ttt age cgt ace ate gac caa agg caa 
lie Pro Leu lie Arg Gly Leu Phe Ser Arg Thr lie Asp Gin Arg Gin 
615 620 625 
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aaa cgc aat ate atg atg ttt att aag cct aag gtg att agt age ttt 
Lys Arg Asn lie Met Met Phe He Lys Pro Lys Val He Ser Ser Phe 
630 635 640 645 



gaa gaa ggc act cgt gtt acc aat aag gaa gga tac aga tac aat tgg 

Glu Glu Gly Thr Arg Val Thr Asn Lys Glu Gly Tyr Arg Tyr Asn Trp 

650 655 660 

gaa get gat gaa gga tec atg caa gtg gee cct cgc cat get cct gaa 

Glu Ala Asp Glu Gly Ser Met Gin Val Ala Pro Arg His Ala Pro Glu 
665 670 675 



tgc caa gga cct cct tct tta cag get gaa agt gac ttt aaa ata ata 
Cys Gin Gly Pro Pro Ser Leu Gin Ala Glu Ser Asp Phe Lys lie lie 
680 685 690 



gaa ata gaa get cag tagtggtata taaaagagga agatgatatt ctccgccgtg 2234 
20 Glu lie Glu Ala Gin 
695 

gaatagcttc tgactctgtt gcattcaggg ggaaagccaa gaagatgtag agtcggccgt 2294 
ataact 2300 



<210> 16 
<211> 9 
<212> PRT 
3 0 <213> Chlamydia pneumoniae 



<220> 

<221> SITE 
<222> (50) . . (58) 



<400> 16 

Ser lie Leu Glu Leu Leu Gin Phe Val 
1 5 



<210> 17 

<211> 1900 

<212> DNA 

<213> Chlamydia pneumoniae 



<220> 
<221> CDS 

<222> (101) . . (1741) 



50 <400> 17 

tagcttgaaa tagcttcctc caattgtgat ttctgaagaa gtataggggg aaatgtcgaa 60 

gagatagtct tgttttaaag gaggagggga aaacggttta atg age aga aaa gac 115 

Met Ser Arg Lys Asp 
1 5 



60 



aat gag gtt tec tta get cgt tea att ttt aat ata tta tec gga act 
Asn Glu Val Ser Leu Ala Arg Ser lie Phe Asn lie Leu Ser Gly Thr 
10 15 20 



163 
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ttc tgt agt cgt att aca ggg ata ttt cga gaa att gca atg gca acc 211 
Phe Cys Ser Arg lie Thr Gly lie Phe Arg Glu lie Ala Met Ala Thr 
25 30 35 

tat ttt gga get gat cca att gta get get ttc tgg tta ggt ttc cgt 259 
Tyr Phe Gly Ala Asp Pro lie Val Ala Ala Phe Trp Leu Gly Phe Arg 
40 45 50 

act gtt ttt ttc tta aga aaa att tta gga ggg etc att eta gaa caa 307 
Thr Val Phe Phe Leu Arg Lys lie Leu Gly Gly Leu lie Leu Glu Gin 
55 60 65 

gec ttc ate cct cat ttt gaa ttt etc cgt get caa agt etc gat cgt 355 
Ala Phe lie Pro His Phe Glu Phe Leu Arg Ala Gin Ser Leu Asp Arg 
70 75 80 85 

gcg gcg ttt ttt ttc cga cgc ttt tct aga ttg att aaa ggc age act 403 
Ala Ala Phe Phe Phe Arg Arg Phe Ser Arg Leu lie Lys Gly Ser Thr 
90 95 100 

att ata ttc act ctg ctt att gaa gca gta ttg tgg gta ttc ttc aat 451 
lie lie Phe Thr Leu Leu lie Glu Ala Val Leu Trp Val Phe Phe Asn 
105 110 115 

aac gtt gaa gag ggg act tac gat atg att etc ctt act atg ata etc 499 
Asn Val Glu Glu Gly Thr Tyr Asp Met lie Leu Leu Thr Met lie Leu 
120 125 130 

ttg ccc tgt ggc att ttc tta atg atg tac aat gta aac ggc get ttg 547 
Leu Pro Cys Gly lie Phe Leu Met Met Tyr Asn Val Asn Gly Ala Leu 
135 140 145 

ctt cac tgt gga aat aag ttt ttc ggg gtg gga tta get ccc gta gtt 595 
Leu His Cys Gly Asn Lys Phe Phe Gly Val Gly Leu Ala Pro Val Val 
150 155 160 165 

gta aat ate att tgg att ttc ttt gtt ata gcg get cgt cat tea gat 643 
Val Asn lie lie Trp lie Phe Phe Val lie Ala Ala Arg His Ser Asp 
170 175 180 

cct aga gag cgt att ate ggt tta tec gtg get eta gtt ate ggg ttt 691 
Pro Arg Glu Arg lie lie Gly Leu Ser Val Ala Leu Val lie Gly Phe 
185 190 195 

ttc ttc gaa tgg tta ate acg gtt cct gga gta tgg aaa ttt eta tta 739 
Phe Phe Glu Trp Leu lie Thr Val Pro Gly Val Trp Lys Phe Leu Leu 
200 205 210 

gaa gcg aag age cca cct caa gaa cac gat agt gtt cga get tta tta 787 
Glu Ala Lys Ser Pro Pro Gin Glu His Asp Ser Val Arg Ala Leu Leu 
215 220 225 

get ccc tta tct ttg ggt att tta act tea age ate ttc cag ctg aac 835 
Ala Pro Leu Ser Leu Gly lie Leu Thr Ser Ser lie Phe Gin Leu Asn 
230 235 240 245 

ctt ctt tct gat ate tgc ttg get cgc tat gta cat gaa ata ggc cct 883 
Leu Leu Ser Asp lie Cys Leu Ala Arg Tyr Val His Glu lie Gly Pro 
250 255 260 
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eta tat ctt atg tac tec tta aag att tat cag etc ccc ata cat etc 
Leu Tyr Leu Met Tyr Ser Leu Lys lie Tyr Gin Leu Pro lie His Leu 
265 270 275 

ttt ggc ttt ggt gtg ttt acc gtt etc etc cca gca att tct cgt tgt 
Phe Gly Phe Gly Val Phe Thr Val Leu Leu Pro Ala lie Ser Arg Cys 
280 285 290 

gta cag cga gaa gat cat gag agg gga ttg aaa ctt atg aag ttc gtt 
Val Gin Arg Glu Asp His Glu Arg Gly Leu Lys Leu Met Lys Phe Val 
295 300 305 

etc acc eta acc atg tec gta atg ate att atg aca gca ggg eta ttg 
Leu Thr Leu Thr Met Ser Val Met lie lie Met Thr Ala Gly Leu Leu 
310 315 320 325 

etc tta get tta cct gga gtc cgt gtc ctt tat gaa cac gga ctt ttc 
Leu Leu Ala Leu Pro Gly Val Arg Val Leu Tyr Glu His Gly Leu Phe 
330 335 340 

cct cag agt get gtc tac get att gtt cgt gta ttg cga ggt tat ggt 
Pro Gin Ser Ala Val Tyr Ala lie Val Arg Val Leu Arg Gly Tyr Gly 
345 350 355 

gee agt att ate cct atg gee ttg get cct tta gtc tct gtt ctt ttt 
Ala Ser lie lie Pro Met Ala Leu Ala Pro Leu Val Ser Val Leu Phe 
3S0 365 370 

tat gca cag egg cag tat get gtt ccg etc ttt ata gga ate ggt acg 
Tyr Ala Gin Arg Gin Tyr Ala Val Pro Leu Phe lie Gly lie Gly Thr 
375 380 385 

get ttg gee aat att gtt tta age ttg gtt eta ggt cgt tgg gtt tta 
Ala Leu Ala Asn lie Val Leu Ser Leu Val Leu Gly Arg Trp Val Leu 
390 395 400 405 

aaa gac gtc teg ggc att tec tat get aca tec ata act get tgg gtg 
Lys Asp Val Ser Gly lie Ser Tyr Ala Thr Ser lie Thr Ala Trp Val 
410 415 420 

cag tta tat ttc etc tgg tat tat tct teg aaa aga etc cct atg tac 
Gin Leu Tyr Phe Leu Trp Tyr Tyr Ser Ser Lys Arg Leu Pro Met Tyr 
425 430 435 

tct aag tta ctt tgg gag age ate egg cgt tec ata aaa gtt atg gga 
Ser Lys Leu Leu Trp Glu Ser lie Arg Arg Ser lie Lys Val Met Gly 
440 445 450 

acc act atg ctt get tgt atg att act eta ggc tta aat ate ctt acg 
Thr Thr Met Leu Ala Cys Met lie Thr Leu Gly Leu Asn lie Leu Thr 
455 460 465 

caa act aca tat gta att ttc tta aac ccc etc aca cca ctt get tgg 
Gin Thr Thr Tyr Val lie Phe Leu Asn Pro Leu Thr Pro Leu Ala Trp 
470 475 480 485 

ccc tta tec tec ata acg get caa gca att get ttt tta tct gag age 
Pro Leu Ser Ser lie Thr Ala Gin Ala lie Ala Phe Leu Ser Glu Ser 
490 495 500 
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tgc att ttc ttg get ttt ttg ttt ggt ttt gca aaa ctg ctt cga gta 1651 
Cys lie Phe Leu Ala Phe Leu Phe Gly Phe Ala Lys Leu Leu Arg Val 
505 510 515 

gaa gat ctt att aat ttg get tct ttt gaa tac tgg cgt ggg caa egg 1699 
Glu Asp Leu lie Asn Leu Ala Ser Phe Glu Tyr Trp Arg Gly Gin Arg 
520 525 530 

ggt ctt ttg caa aga caa cac gtg atg caa gac act caa aat 1741 
Gly Leu Leu Gin Arg Gin His Val Met Gin Asp Thr Gin Asn 
535 540 545 

taatcatgtt tgtttcttgt agctcagtcg ctttctttta gctttaagtt ttgatagect 1801 
gcttggtctt ctgtttctac acttaatatt gatactaagg atactatgaa aaaacaggta 1861 
tatcaatggt tagcgagtgt ggttctttta gegctgaca 1900 



20 <210> 18 

<211> 9 

<212> PRT 

<213> Chlamydia pneumoniae 



<220> 

<221> SITE 

<222> (111) . . (119) 



<400> 18 

3 0 Val Leu Trp Val Phe Phe Asn Asn Val 
l 5 



<210> 19 
<211> 3150 
<212> DNA 

<213> Chlamydia pneumoniae 



<220> 
40 <221> CDS 

<222> (101) . . (2962) 



<400> 19 

tctcaagagt aaccttatcc ttagattatt cagctcaagt ctcctcgtca actgtaggtc 60 

aataccttaa agctgagagt cattgeacat tttaaccaca atg aaa aca tea agg 115 

Met Lys Thr Ser Arg 
1 5 



5 0 aat aaa cag tgc aaa ata aca gat ccc tta agt aaa tct tec ttc ttt 
Asn Lys Gin Cys Lys lie Thr Asp Pro Leu Ser Lys Ser Ser Phe Phe 



gtt gga gee tta att tta ggt aaa act aca ata etc ctt aat gcg act 
Val Gly Ala Leu lie Leu Gly Lys Thr Thr lie Leu Leu Asn Ala Thr 



ccg ttg tct gac tat ttt gat aat caa gca aat caa etc aca aca etc 
Pro Leu Ser Asp Tyr Phe Asp Asn Gin Ala Asn Gin Leu Thr Thr Leu 
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ttc cct 

Phe Pro 

55 

gca aca 

Ala Thr 
70 

10 

gat cac 

Asp His 



ggc ggt 
Gly Gly 



caa att 
20 Gin lie 



tat gtt 

Tyr Val 
135 

ttc tct 

Phe Ser 
150 

30 

tgt aca 

Cys Thr 



aat caa 
Asn Gin 



gga gca 
4 0 Gly Ala 



tgc ctg 
Cys Leu 
215 

ttc get 
Phe Ala 
230 

50 

ttg aat 
Leu Asn 



tgc att 
Cy s lie 



gec atg 
6 0 Ala Met 



eta att gat 
Leu lie Asp 



ctt ttt gga 
Leu Phe Gly 



cag aat tec 
Gin Asn Ser 
90 

get etc tct 
Ala Leu Ser 
105 

ctt ttc eta 
Leu Phe Leu 
120 

gat ggt aat 
Asp Gly Asn 



ggg aat tta 
Gly Asn Leu 



ggg gga get 
Gly Gly Ala 
170 

gga ace gca 
Gly Thr Ala 
185 

ate caa get 
lie Gin Ala 
200 

ttt ttt aat 
Phe Phe Asn 



aat get tgt 
Asn Ala Cys 



aac caa tea 
Asn Gin Ser 
250 

ctt aca aag 
Leu Thr Lys 
265 

gaa gcg gac 
Glu Ala Asp 
280 



act ctt act 
Thr Leu Thr 
60 

gtt agg gat 
Val Arg Asp 
75 

ata gaa age 
lie Glu Ser 



tgc aaa tea 
Cys Lys Ser 



aat age ttt 
Asn Ser Phe 
125 

ttc gat ctt 
Phe Asp Leu 
140 

age ttt cct 
Ser Phe Pro 
155 

gtt tta tgt 
Val Leu Cys 



tac ttc att 
Tyr Phe lie 



gca ate ata 
Ala lie lie 
205 

aat get gca 
Asn Ala Ala 
220 

aga att gag 
Arg lie Glu 
235 

ggt ctg ggt 
Gly Leu Gly 



aat acc ggt 
Asn Thr Gly 



ate tct get 
lie Ser Ala 
285 



aac atg act 
Asn Met Thr 



gac act aac 

Asp Thr Asn 
80 

tgg ttc gaa 

Trp Phe Glu 
95 

ctt gee ata 

Leu Ala lie 
110 

get att aaa 

Ala lie Lys 



tct gag aat 
Ser Glu Asn 



aat gca agt 
Asn Ala Ser 
160 

teg aaa aat 
Ser Lys Asn 
175 

aac aac aag 
Asn Asn Lys 
190 

aac att aag 
Asn lie Lys 



ggc gga aca 
Gly Gly Thr 



aat aat tct 
Asn Asn Ser 
240 

ggt gca ata 
Gly Ala lie 
255 

tct gtg ate 
Ser Val lie 
270 

aac cat tec 
Asn His Ser 



ccc tac tct 
Pro Tyr Ser 
65 

caa gac att 
Gin Asp lie 



aac ttc tct 
Asn Phe Ser 



acg aat aca 
Thr Asn Thr 
115 

aga get ggt 
Arg Ala Gly 
130 

cat ggt tec 
His Gly Ser 
145 

aat ttc get 
Asn Phe Ala 



gtt aca ate 
Val Thr lie 



gca aaa tct 
Ala Lys Ser 
195 

gac aac act 
Asp Asn Thr 
210 

gcg ggg ggc 
Ala Gly Gly 
225 

cag cct ate 
Gin Pro lie 



aga gta cat 
Arg Val His 



ttc aac aat 
Phe Asn Asn 
275 

tct gga ggg 
Ser Gly Gly 
290 



cat aga 307 
His Arg 



gtc etc 355 
Val Leu 
85 

caa gac 403 

Gin Asp 

100 

aaa aac 451 
Lys Asn 



gcg atg 499 
Ala Met 



ate att 547 
lie lie 



gat act 595 
Asp Thr 
165 

tea aaa 643 

Ser Lys 

180 

tea gga 691 
Ser Gly 



ggc cct 739 
Gly Pro 



gcg ttg 787 
Ala Leu 



tat ttt 835 
Tyr Phe 
245 

caa gag 883 

Gin Glu 

260 

aat ttt 931 
Asn Phe 



get ate 979 
Ala lie 
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tat tgc att agt tgt tct ata aaa gac aac cca gga att gca gcc ttc 
Tyr Cys lie Ser Cys Ser lie Lys Asp Asn Pro Gly lie Ala Ala Phe 
295 300 305 

gat aat aat act gca gca cga gat gga ggt get ate tgt aca caa tct 
Asp Asn Asn Thr Ala Ala Arg Asp Gly Gly Ala lie Cys Thr Gin Ser 
310 315 320 325 

eta act ata caa gac agt ggt ccc gtc tat ttc aca aac aat cag gga 
Leu Thr lie Gin Asp Ser Gly Pro Val Tyr Phe Thr Asn Asn Gin Gly 
330 335 340 

act tgg ggc ggc get ate atg etc cgt caa gat ggt gca tgc act tta 
Thr Trp Gly Gly Ala lie Met Leu Arg Gin Asp Gly Ala Cys Thr Leu 
345 350 355 

ttt get gat cag gga gat att att ttt tat aat aat aga cac ttc aaa 
Phe Ala Asp Gin Gly Asp lie lie Phe Tyr Asn Asn Arg His Phe Lys 
360 365 370 

gat act ttc age aat cat gtt tct gta aac tgc acg cgt aat gtc tea 
Asp Thr Phe Ser Asn His Val Ser Val Asn Cys Thr Arg Asn Val Ser 
375 380 385 

tta aca gtt gga gca agt caa ggt cat tct get acc ttc tat gat ccc 
Leu Thr Val Gly Ala Ser Gin Gly His Ser Ala Thr Phe Tyr Asp Pro 
390 395 400 405 

ata eta caa aga tat act ata caa aac tct ate caa aaa ttt aat cct 
lie Leu Gin Arg Tyr Thr lie Gin Asn Ser lie Gin Lys Phe Asn Pro 
410 415 420 

aat cca gaa cac etc gga act ate ttg ttc tec tea aca tat att ccg 
Asn Pro Glu His Leu Gly Thr lie Leu Phe Ser Ser Thr Tyr lie Pro 
425 430 435 

gat aca teg act tct cgt gat gac ttc att tea cat ttc aga aac cac 
Asp Thr Ser Thr Ser Arg Asp Asp Phe lie Ser His Phe Arg Asn His 
440 445 450 

att gga ctg tac aac ggc aca etc get ctt gaa gat cga gca gag tgg 
lie Gly Leu Tyr Asn Gly Thr Leu Ala Leu Glu Asp Arg Ala Glu Trp 
455 460 465 

aaa gtc tat aaa ttt gat caa ttt ggt ggg act eta egg tta ggc agt 
Lys Val Tyr Lys Phe Asp Gin Phe Gly Gly Thr Leu Arg Leu Gly Ser 
470 475 480 485 

aga get gtg ttt tct aca aca gac gaa gaa caa agt age agt agt gtg 
Arg Ala Val Phe Ser Thr Thr Asp Glu Glu Gin Ser Ser Ser Ser Val 
490 495 500 

ggt tct gta att aac ate aat aat ctt gca att aac ctt ccc tct ate 
Gly Ser Val lie Asn lie Asn Asn Leu Ala lie Asn Leu Pro Ser lie 
505 510 515 

tta ggc aac aga gtt get ccc aag eta tgg att cgc ccc aca ggt tea 
Leu Gly Asn Arg Val Ala Pro Lys Leu Trp lie Arg Pro Thr Gly Ser 
520 525 530 
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tea gca ccc tat age gaa gat aat aac cct ata ate aat etc tea gga 
Ser Ala Pro Tyr Ser Glu Asp Asn Asn Pro lie He Asn Leu Ser Gly 
535 540 545 

cct ttg age eta ctg gat gac gag aac eta gat ccc tat gat act gca 
Pro Leu Ser Leu Leu Asp Asp Glu Asn Leu Asp Pro Tyr Asp Thr Ala 
550 555 560 565 

gac ctt gee caa cct ate gca gaa gtt cct ctt ctg tat etc tta gac 
Asp Leu Ala Gin Pro He Ala Glu Val Pro Leu Leu Tyr Leu Leu Asp 
570 575 580 

gtc aca get aaa cat att aat acg gat aat ttc tac cct gag ggt eta 
Val Thr Ala Lys His He Asn Thr Asp Asn Phe Tyr Pro Glu Gly Leu 
585 590 595 

aat aca act caa cac tac ggc tac caa ggc gtt tgg tec cct tac tgg 
Asn Thr Thr Gin His Tyr Gly Tyr Gin Gly Val Trp Ser Pro Tyr Trp 
600 605 610 

ate gaa aca ate aca act tct gat acc tct tct gaa gat act gtg aat 
He Glu Thr He Thr Thr Ser Asp Thr Ser Ser Glu Asp Thr Val Asn 
615 620 625 

act tta cat cgc cag ctt tat ggt gat tgg aca cct aca gga tat aag 
Thr Leu His Arg Gin Leu Tyr Gly Asp Trp Thr Pro Thr Gly Tyr Lys 
630 635 640 645 

gta aac cca gaa aac aaa gga gac att gec eta tct gec ttc tgg caa 
Val Asn Pro Glu Asn Lys Gly Asp He Ala Leu Ser Ala Phe Trp Gin 
650 655 660 

tct ttc cat aac tta ttt gcg aca eta cgt tat caa aca cag caa ggc 
Ser Phe His Asn Leu Phe Ala Thr Leu Arg Tyr Gin Thr Gin Gin Gly 
665 670 675 

caa ata gca cct aca get tct gga gaa get act cga etc ttc gtg cat 
Gin He Ala Pro Thr Ala Ser Gly Glu Ala Thr Arg Leu Phe Val His 
680 685 690 

caa aat age aac aat gat gcg aaa gga ttc cat atg gaa get acg ggt 
Gin Asn Ser Asn Asn Asp Ala Lys Gly Phe His Met Glu Ala Thr Gly 
695 700 705 

tat tct ttg gga aca acc tea aac act get tct aat cat age ttt ggt 
Tyr Ser Leu Gly Thr Thr Ser Asn Thr Ala Ser Asn His Ser Phe Gly 
710 715 720 725 

gta aac ttc tec caa ctt ttc agt aat etc tac gag age cac tec gac 
Val Asn Phe Ser Gin Leu Phe Ser Asn Leu Tyr Glu Ser His Ser Asp 
730 735 740 

aat tec gtg get teg cat acg aca act gta gcg etc cag ate aat aat 
Asn Ser Val Ala Ser His Thr Thr Thr Val Ala Leu Gin lie Asn Asn 
745 750 755 

cct tgg ctg caa gag aga ttc tct aca tct gca tct eta gec tac age 
Pro Trp Leu Gin Glu Arg Phe Ser Thr Ser Ala Ser Leu Ala Tyr Ser 
760 765 770 
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tac age aac cac cat ate aaa gca tct gga tat tct gga aaa ata caa 2467 

Tyr Ser Asn His His lie Lys Ala Ser Gly Tyr Ser Gly Lys lie Gin 

775 780 785 

acg gaa ggc aaa tgt tat agt acg aca tta ggg gcg get etc tct tgc 2515 

Thr Glu Gly Lys Cys Tyr Ser Thr Thr Leu Gly Ala Ala Leu Ser Cys 

790 795 800 805 

tct eta tct eta caa tgg cga tea cga cct etc cac ttc act cct ttt 2563 
Ser Leu Ser Leu Gin Trp Arg Ser Arg Pro Leu His Phe Thr Pro Phe 

810 815 820 

ate caa gca att gec gtt cgt tct aat caa act gcg ttt caa gaa agt 2611 

He Gin Ala He Ala Val Arg Ser Asn Gin Thr Ala Phe Gin Glu Ser 

825 830 835 

gga gat aaa get aga aaa ttt tct gtt cat aaa ccc tta tat aac ctg 2 659 

Gly Asp Lys Ala Arg Lys Phe Ser Val His Lys Pro Leu Tyr Asn Leu 

840 845 850 

aca gtc cct ctg gga att cag age get tgg gaa tec aag ttc cgt ctt 2707 

Thr Val Pro Leu Gly lie Gin Ser Ala Trp Glu Ser Lys Phe Arg Leu 

855 860 865 

cct ace tat tgg aac ata gag ctt get tat cag cct gtc etc tac caa 2755 

Pro Thr Tyr Trp Asn lie Glu Leu Ala Tyr Gin Pro Val Leu Tyr Gin 

870 875 880 885 

caa aat cct gag ate aac gtg agt eta gaa tct agt gga teg tea tgg 2 803 

Gin Asn Pro Glu lie Asn Val Ser Leu Glu Ser Ser Gly Ser Ser Trp 

890 895 900 

etc eta tea gga ace ace ctt get cgc aat gec att get ttt aaa gga 2851 

Leu Leu Ser Gly Thr Thr Leu Ala Arg Asn Ala lie Ala Phe Lys Gly 

905 910 915 

aga aac caa att ttt ate ttc cct aaa ctt teg gtg ttc tta gac tat 2899 

Arg Asn Gin He Phe He Phe Pro Lys Leu Ser Val Phe Leu Asp Tyr 

920 925 930 

caa ggc teg gta tec tea tea acg acg aca cat tac ctt cac gca gga 2947 

Gin Gly Ser Val Ser Ser Ser Thr Thr Thr His Tyr Leu His Ala Gly 

935 940 945 

acg acc ttt aag ttt taaaagcatg ttatatagac aatgeaaect gtaaagacca 3002 

Thr Thr Phe Lys Phe 

950 

aatagagagt agtgaacact ctctaccatc atgaatctta tgggagaagc taagggaaat 3062 
ccacagatac gtttccccca taaaaattaa gaacccgata catcctcact agagattcga 3122 

aagaactact taaatcctaa gcattcga 3150 



<210> 20 
<211> 9 
<212> PRT 

<213> Chlamydia pneumoniae 



<220> 
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<221> SITE 

<222> (119) . . (127) 



<400> 20 

lie Leu Phe Leu Asn Ser Phe Ala lie 
1 5 



<210> 21 
<211> 3200 
<212> DNA 

<213> Chlamydia pneumoniae 



<220> 
<221> CDS 

<222> (101) . . (3100) 



20 <400> 21 

tcctgaactc cactcgaaat 
tgtgacctac aatgctttag 
cactctctcc ttagattact 
cgtggcgagt agaatgagat 
tctagtgtca aagactcctc 
gttcgggatg actcctgcag 
agagagggat gaagagtttc 
aggattttct ccaataacta 
actttctaac tacaagtcta 

3 0 tgtgtcctgt aataatttct 
tctcgcgatt gggactggag 
tagaggaccc cttatttttt 
aactcgtggg ggtgcgattg 
tttctacttt gtcaacaatt 
ctgccgcatc caaagcaaca 
a 99999t9 c 9 cttcgtagtg 
taagaacaac tgtgggaaca 
aaataactcc gggtcggtga 
aggaaatggt tcaggagggg 

40 aactattctt ttcaataata 
tttgacaatc aaaaatagtg 
tgctcttatg ctcctacagg 
atttcaaaat aatgaggttt 
accaaatagc aacttacaac 
tatagaacac caacatccaa 
gggaacgatc ttattttctt 
cattagcagc tcgaaaaata 
tgcgggatgg caattctata 
ggcgagtatt gcaacaactg 

50 catcattaat aaccttgcga 
cttgtggatc cgtcctctac 
tactttatca ggtcctctga 
agatctctct gagcctttac 
tcatatcaat accgataact 
tcaaggcatc tggtctcctt 
agagacggca aacaccctct 
ggtcaatcct gaataccaag 
tatgttctct ctattaagaa 
cttagaaatt caagggattg 

6 0 tccaggattc cgtatccaat 
acatcagaaa atctccttag 



tactgattag ccaaggtacg 
ggatcaaagt gaaaaatacc 
ctgcggatat ttcttcctcc 
ttttaacaat aagtgaccaa 
ctaagttttt attctatctc 
tgtatagttt acaaacggac 
gtacgagctt tcctctctta 
cgtttgttgg aaatagacat 
ttgataacat ccttcttctt 
tattatcaaa tgttgaagac 
gcgcgattgc ttgccaggga 
tcagcaatcg aggtcttaac 
cctgtaatgg agacttcacg 
ccgtcaacaa ctggggagga 
gggcacctct actctttttt 
aaaatacaac gatctctgat 
atggcggggc cattcaaaca 
ttttcaataa caacacagcg 
cgatttatac aacaaaccta 
actactgcat tcgcgatggc 
gccacgtata tttcaccaac 
acagcacctg cctactcttc 
tcctcaccac atttggtaga 
ttggagctaa taaggggtat 
ctacaaatcc tctaatcttt 
cagcctatat cccagaagct 
cctctgaact tcgcaatggt 
agttcactca aaaaggaggt 
ccaactctga gactccatca 
ttaacctccc ctcgatctta 
aatctagtgc tcctttcaca 
cactcttaaa tgaggaaaac 
aaaacattca tcttctttct 
ttcatcctga aagcttaaat 
attgggtaga gacgataaca 
acagagctct gtatgccaat 
gagatcttgc tacgactccc 
gttataatcg aactggtgat 
ccgacggcct ctttgttcat 
ctacagggta ttccttacaa 
gttttgcaca gttcttcacc 



tggacgacgc aggccactcc 60 
atgcaggtgt ttcctaaagt 12 0 
acgctgagtc actacttaaa 180 
aacagaaaga ttaaggaacc 240 
gggaatttca cagcctgcat 300 
tcccttgaaa agtttgcttt 360 
gactctctct ccactcttac 420 
aattcctctc aagacattgt 480 
tggacatcgg ctgggggagc 540 
catgccttct tcagtaaaaa 600 
gcctgcacaa tcacgaagaa 660 
aatgcgagta caggaggaga 72 0 
atttctcaaa atcaagggac 780 
gccctctcca ccaatggaca 840 
aacaatacag cccctagtgg 900 
aacacgcgtc ctatttattt 960 
agcgttactg ttgcgataaa 1020 
ttatctggtt cgataaattc 1080 
tccatagacg ataaccctgg 1140 
ggagctatct gtacacaatt 1200 
aatcaaggaa actggggagg 12 6 0 
gcggaacaag gaaatatcgc 132 0 
tacaacgcca tacattgtac 1380 
acgactgctt tttttgatcc 1440 
aatcccaatg cgaaccatca 1500 
tctgactacg aaaataattt 1560 
gtcctctcta tcgaggatcg 1620 
atccttaaat tagggcatgc 1680 
actagtgtag gctcccaggt 1740 
gcaaaaggaa aagctcctac 1800 
gaggacaata accctacaat 1860 
cgcgatccct acgacagtat 1920 
ttatcggatg taacagcacg 1980 
gcgactgagc attacggtta 2040 
acaacaaata acgcttctat 2100 
tggactccct taggatataa 2160 
ctatggcaat cctttcatac 2220 
tctgatatcg agaggccttt 2280 
caaaatagca tccccggggc 2340 
gcatcctccg aaacttcttt 2400 
cgcactaaag aaatcggatc 2460 
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aagcaacaac gtctcggctc acaatacagt ctcttcactt tatgttgagc ttccgtggtt 2520 

ccaagaggcc tttgcaacat cccacagttt agcgtatggc tatggggacc atcacctcca 258 0 

cgcctacatc cgtcacatca agaacagggc agaagggacg tgttatagcc atacattagc 264 0 

agcagctatc ggctgttctt tcccttggca acagaaatcc tatcttcacc tcagcccgtt 2700 

cgttcaggca attgcaatac gttctcacca aacagcgttc gaagagattg gtgacaatcc 2760 

ccgaaagttt gtctctcaaa agcctttcta taatctgacc ttacctctag gaatccaagg 2820 

aaaatggcag tcaaaattcc acgtacctac agaatggact ctagaacttt cttaccaacc 2880 

ggtactctat caacaaaatc cccaaatcgg tgtcacgcta cttgcgagcg gaggttcctg 2940 

ggatatccta ggccataact atgttcgcaa tgctttaggg tacaaagtcc acaatcaaac 3000 

tgcgctcttc cgttctctcg atctattctt ggattaccaa ggatcggtct cctcctcgac 3060 

atctacgcac catctccaag caggaagtac cttaaaattc taaaataaaa gaacgataaa 312 0 

attgaaatct ttagaattaa caactatccg atgagctacg ttagcccaat cggtagagga 3180 

ctccctcaaa atttaaataa 3200 



<210> 22 

<211> 9 

<212> PRT 

<213> Chlamydia pneumoniae 
<220> 

<221> SITE 

<222> (56) . . (64) 

<400> 22 

Phe Leu Phe Tyr Leu Gly Asn Phe Thr 



<210> 23 
<211> 3000 
<212> DNA 

<213> Chlamydia pneumoniae 



<220> 
<221> CDS 

<222> (101) . . (2893) 
<400> 23 

tagacactat aaaacaaatt atagacaaaa aatctagcat tgatttattc agaatatttc 60 

tttctatttg tgaacgagta tgcgcttttt ttgcttcgga atg ttg ctt cct ttt 115 

Met Leu Leu Pro Phe 



act ttt gta ttg get aat gaa ggt etc caa ctt cct ttg gag acc tat 163 

Thr Phe Val Leu Ala Asn Glu Gly Leu Gin Leu Pro Leu Glu Thr Tyr 

10 15 20 

att aca tta agt cct gaa tat caa gca gec cct caa gta ggg ttt act 211 

lie Thr Leu Ser Pro Glu Tyr Gin Ala Ala Pro Gin Val Gly Phe Thr 

25 30 35 

cat aac caa aat caa gat etc gca att gtc ggg aat cac aat gat ttc 259 

His Asn Gin Asn Gin Asp Leu Ala lie Val Gly Asn His Asn Asp Phe 



35/83 



ate ttg gac tat aag tac tat egg teg aat gga ggt get ctt ace tgt 
lie Leu Asp Tyr Lys Tyr Tyr Arg Ser Asn Gly Gly Ala Leu Thr Cys 



aag aat ctt ctg ate tct gaa aat ata ggg aat gtc ttc ttt gag aag 
Lys Asn Leu Leu lie Ser Glu Asn lie Gly Asn Val Phe Phe Glu Lys 



aat gtc tgt ccc aat tct ggc ggg gca att tat get get caa aat tgc 

Asn Val Cys Pro Asn Ser Gly Gly Ala lie Tyr Ala Ala Gin Asn Cys 

90 95 100 

acg ate tec aag aat cag aac tat gca ttt act aca aac ttg gtc tct 

Thr lie Ser Lys Asn Gin Asn Tyr Ala Phe Thr Thr Asn Leu Val Ser 

105 110 115 

gac aat cct aca gee act gcg gga tea eta ttg ggt gga get etc ttt 

Asp Asn Pro Thr Ala Thr Ala Gly Ser Leu Leu Gly Gly Ala Leu Phe 

120 125 130 

gee ata aat tgc tct att act aat aac eta gga cag gga act ttc gtt 

Ala lie Asn Cys Ser lie Thr Asn Asn Leu Gly Gin Gly Thr Phe Val 

135 140 145 

gac aat etc get tta aat aag ggg ggt gee etc tat act gag acg aac 

Asp Asn Leu Ala Leu Asn Lys Gly Gly Ala Leu Tyr Thr Glu Thr Asn 

150 155 160 165 

tta tct att aaa gac aat aaa ggc ccg ate ata ate aag cag aat egg 

Leu Ser lie Lys Asp Asn Lys Gly Pro lie lie lie Lys Gin Asn Arg 

170 175 180 

gca eta aat teg gac agt tta gga gga ggg att tat agt ggg aac tct 

Ala Leu Asn Ser Asp Ser Leu Gly Gly Gly lie Tyr Ser Gly Asn Ser 

185 190 195 

eta aat ata gag gga aat tct gga get ata cag ate aca age aac tct 

Leu Asn lie Glu Gly Asn Ser Gly Ala lie Gin lie Thr Ser Asn Ser 

200 205 210 

tea gga tct ggg gga ggc ata ttt tct acc caa aca etc acg ate tec 

Ser Gly Ser Gly Gly Gly lie Phe Ser Thr Gin Thr Leu Thr lie Ser 

215 220 225 

teg aat aaa aaa etc ata gaa ate agt gaa aat tec gcg ttc gca aat 

Ser Asn Lys Lys Leu lie Glu lie Ser Glu Asn Ser Ala Phe Ala Asn 

230 235 240 245 

aac tat gga teg aac ttc aat cca gga gga gga ggt ctt act acc acc 

Asn Tyr Gly Ser Asn Phe Asn Pro Gly Gly Gly Gly Leu Thr Thr Thr 

250 255 260 

ttt tgc acg ata ttg aac aac cga gaa ggg gta etc ttt aac aat aac 

Phe Cys Thr lie Leu Asn Asn Arg Glu Gly Val Leu Phe Asn Asn Asn 

265 270 275 

caa age cag age aac ggt gga gee att cat gcg aaa tct ate att ate 

Gin Ser Gin Ser Asn Gly Gly Ala lie His Ala Lys Ser lie lie lie 

280 285 290 
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aaa gaa aat ggt cct gta tac ttt tta aat aac act gca act egg gga 
Lys Glu Asn Gly Pro Val Tyr Phe Leu Asn Asn Thr Ala Thr Arg Gly 

295 300 305 

ggg get etc etc aac tta tea gca ggt tct gga aac gga age ttc ate 
Gly Ala Leu Leu Asn Leu Ser Ala Gly Ser Gly Asn Gly Ser Phe lie 

310 315 320 325 

tta tct gca gat aat gga gat att ate ttt aac aat aat acg gee tec 

Leu Ser Ala Asp Asn Gly Asp lie lie Phe Asn Asn Asn Thr Ala Ser 

330 335 340 

aag cat gee etc aat cct cca tac aga aac gee att cac teg act cct 

Lys His Ala Leu Asn Pro Pro Tyr Arg Asn Ala lie His Ser Thr Pro 

345 350 355 

aat atg aat ctg caa ata gga gee cgt ccc ggc tat cga gtg ctg ttc 
Asn Met Asn Leu Gin lie Gly Ala Arg Pro Gly Tyr Arg Val Leu Phe 

360 365 370 

tat gat ccc ata gaa cat gag etc cct tec tec ttc ccc ata etc ttt 

Tyr Asp Pro lie Glu His Glu Leu Pro Ser Ser Phe Pro lie Leu Phe 

375 380 385 

aat ttc gaa ace ggt cat aca ggt aca gtt tta ttt tea ggg gaa cat 

Asn Phe Glu Thr Gly His Thr Gly Thr Val Leu Phe Ser Gly Glu His 

390 395 400 405 

gta cac cag aac ttt ace gat gaa atg aat ttc ttt tec tat tta agg 

Val His Gin Asn Phe Thr Asp Glu Met Asn Phe Phe Ser Tyr Leu Arg 

410 415 420 

aac act teg gaa eta cgt caa gga gtc ctt get gtt gaa gat ggt gcg 

Asn Thr Ser Glu Leu Arg Gin Gly Val Leu Ala Val Glu Asp Gly Ala 

425 430 435 

ggg ctg gee tgc tat aag ttc ttc caa cga gga ggc act eta ctt eta 

Gly Leu Ala Cys Tyr Lys Phe Phe Gin Arg Gly Gly Thr Leu Leu Leu 

440 445 450 

ggt caa ggt gcg gtg ate acg aca gca gga acg att ccc aca cca tec 

Gly Gin Gly Ala Val lie Thr Thr Ala Gly Thr lie Pro Thr Pro Ser 

455 460 465 

tea aca cca acg aca gta gga agt act ata act tta aat cac att gec 

Ser Thr Pro Thr Thr Val Gly Ser Thr lie Thr Leu Asn His lie Ala 

470 475 480 485 

att gac ctt cct tct att ctt tct ttt caa get cag get cca aaa att 

lie Asp Leu Pro Ser lie Leu Ser Phe Gin Ala Gin Ala Pro Lys lie 

490 495 500 

tgg att tac ccc aca aaa aca gga tct ace tat act gaa gat tec aac 

Trp lie Tyr Pro Thr Lys Thr Gly Ser Thr Tyr Thr Glu Asp Ser Asn 

505 510 515 

ccg aca ate aca ate tea gga act etc ace tta cgc aac age aac aac 

Pro Thr lie Thr lie Ser Gly Thr Leu Thr Leu Arg Asn Ser Asn Asn 

520 525 530 
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gaa gat ccc tac gat agt ctg gat etc teg cac tct ctt gag aaa gtt 
Glu Asp Pro Tyr Asp Ser Leu Asp Leu Ser His Ser Leu Glu Lys Val 
535 540 545 

ccc ctt ctt tat att gtc gat gtc get gca caa aaa att aac tct teg 
Pro Leu Leu Tyr lie Val Asp Val Ala Ala Gin Lys lie Asn Ser Ser 
550 555 560 565 

caa ctg gat eta tec aca tta aat tct ggc gaa cac tat ggg tat caa 
Gin Leu Asp Leu Ser Thr Leu Asn Ser Gly Glu His Tyr Gly Tyr Gin 
570 575 580 

ggc ate tgg teg acc tat tgg gta gaa act aca aca ate acg aac cct 
Gly lie Trp Ser Thr Tyr Trp Val Glu Thr Thr Thr lie Thr Asn Pro 
585 590 595 

aca tct eta eta ggc gcg aat aca aaa cac aag ctg etc tat gca aac 
Thr Ser Leu Leu Gly Ala Asn Thr Lys His Lys Leu Leu Tyr Ala Asn 
600 605 610 

tgg tct cct eta ggc tac cgt cct cat ccc gaa cgt cga gga gaa ttc 
Trp Ser Pro Leu Gly Tyr Arg Pro His Pro Glu Arg Arg Gly Glu Phe 
615 620 625 

att acg aat gec ttg tgg caa teg gca tat acg get ctt gca gga etc 
lie Thr Asn Ala Leu Trp Gin Ser Ala Tyr Thr Ala Leu Ala Gly Leu 
630 635 640 645 

cac tec etc tec tec tgg gat gaa gag aag ggt cat gca get tec eta 
His Ser Leu Ser Ser Trp Asp Glu Glu Lys Gly His Ala Ala Ser Leu 
650 655 660 

caa ggc att ggt ctt ctg gtt cat caa aaa gac aaa aac ggt ttt aag 
Gin Gly lie Gly Leu Leu Val His Gin Lys Asp Lys Asn Gly Phe Lys 
665 670 675 

gga ttt cgt agt cat atg aca ggt tat agt get acc acc gaa gca acc 
Gly Phe Arg Ser His Met Thr Gly Tyr Ser Ala Thr Thr Glu Ala Thr 
680 685 690 

tct tct caa agt ccg aat ttc tct tta gga ttt get cag ttc ttc tec 
Ser Ser Gin Ser Pro Asn Phe Ser Leu Gly Phe Ala Gin Phe Phe Ser 
695 700 705 

aaa get aaa gaa cat gaa tct caa aat age acg tec tct cac cac tat 
Lys Ala Lys Glu His Glu Ser Gin Asn Ser Thr Ser Ser His His Tyr 
710 715 720 725 

ttc tct gga atg tgc ata gca aaa tac tct ctt caa aga gtg ata cgt 
Phe Ser Gly Met Cys lie Ala Lys Tyr Ser Leu Gin Arg Val lie Arg 
730 735 740 

eta tct gtg tct ctt get tat atg ttt acc teg gaa cat acc cat aca 
Leu Ser Val Ser Leu Ala Tyr Met Phe Thr Ser Glu His Thr His Thr 
745 750 755 

atg tat cag ggt etc ctg gaa ggg aac tct cag gga tct ttc cac aac 
Met Tyr Gin Gly Leu Leu Glu Gly Asn Ser Gin Gly Ser Phe His Asn 
760 765 770 
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cat acc tta gca ggg get etc tec tgt gtt ttc tta cct caa cct cac 2467 
His Thr Leu Ala Gly Ala Leu Ser Cys Val Phe Leu Pro Gin Pro His 

775 780 785 

ggc gag tec ctg cag ate tat ccc ttt att act gee tta gee ate cga 2515 

Gly Glu Ser Leu Gin lie Tyr Pro Phe lie Thr Ala Leu Ala lie Arg 
790 795 800 805 

gga aat ctt get gcg ttt caa gaa tct gga gac cat get egg gaa ttt 2563 

Gly Asn Leu Ala Ala Phe Gin Glu Ser Gly Asp His Ala Arg Glu Phe 
810 815 820 

tec eta cac cgc ccc eta acg gac gtc tec etc cct gta gga ate cgc 2611 

Ser Leu His Arg Pro Leu Thr Asp Val Ser Leu Pro Val Gly lie Arg 
825 830 835 

get tct tgg aag aac cac cac cga gtt ccc eta gtc tgg etc aca gaa 2659 

Ala Ser Trp Lys Asn His His Arg Val Pro Leu Val Trp Leu Thr Glu 
840 845 850 

att tec tat cgc tct act etc tat agg caa gat cct gaa etc cac teg 2707 

lie Ser Tyr Arg Ser Thr Leu Tyr Arg Gin Asp Pro Glu Leu His Ser 

855 860 865 

aaa tta ctg att age caa ggt acg tgg acg acg cag gec act cct gtg 2755 

Lys Leu Leu lie Ser Gin Gly Thr Trp Thr Thr Gin Ala Thr Pro Val 
870 875 880 885 

acc tac aat get tta ggg ate aaa gtg aaa aat acc atg cag gtg ttt 2803 

Thr Tyr Asn Ala Leu Gly lie Lys Val Lys Asn Thr Met Gin Val Phe 
890 895 900 

cct aaa gtc act etc tec tta gat tac tct gcg gat att tct tec tec 2851 

Pro Lys Val Thr Leu Ser Leu Asp Tyr Ser Ala Asp lie Ser Ser Ser 
905 910 915 

acg ctg agt cac tac tta aac gtg gcg agt aga atg aga ttt 2893 

Thr Leu Ser His Tyr Leu Asn Val Ala Ser Arg Met Arg Phe 
920 925 930 

taacaataag tgaccaaaac agaaagatta aggaacctct agtgtcaaag actcctccta 2953 
agtttttatt etatcteggg aatttcacag cctgcatgtt egggatg 3000 



<210> 24 

<211> 9 

<212> PRT 

5 0 <213> Chlamydia pneumoniae 

<220> 

<221> SITE 

<222> (1) . . (9) 

<400> 24 

Met Leu Leu Pro Phe Thr Phe Val Leu 



60 



opto: 
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<210> 25 
<211> 3150 
<212> DNA 

<213> Chlamydia pneumoniae 

<220> 
<221> CDS 

<222> (100) . . (3033) 
<400> 25 

cactgtggat gtgatattcg cagaacctcc cgtcaaatat actctagata taggaagcaa 60 

attacgattt taaaccttat ttaacgacag ggttgaggc atg cct ctt tct ttc 114 

Met Pro Leu Ser Phe 



aaa tct tea tct ttt tgt eta ctt gec tgt tta tgt agt gca agt tgc 
Lys Ser Ser Ser Phe Cys Leu Leu Ala Cys Leu Cys Ser Ala Ser Cys 



gcg ttt get gag act aga etc gga ggg aac ttt gtt cct cca att acg 
Ala Phe Ala Glu Thr Arg Leu Gly Gly Asn Phe Val Pro Pro lie Thr 



aat cag ggt gaa gag ate tta etc act tea gat ttt gtt tgt tea aac 
Asn Gin Gly Glu Glu lie Leu Leu Thr Ser Asp Phe Val Cys Ser Asn 



ttc ttg ggg gcg agt ttt tea agt tec ttt ate aat agt tec age aat 
Phe Leu Gly Ala Ser Phe Ser Ser Ser Phe lie Asn Ser Ser Ser Asn 



etc tec tta tta ggg aag ggc ctt tec tta acg ttt ace tct tgt caa 
Leu Ser Leu Leu Gly Lys Gly Leu Ser Leu Thr Phe Thr Ser Cys Gin 



get cct aca aat agt aac tat gcg eta ctt tct gec gca gag act ctg 

Ala Pro Thr Asn Ser Asn Tyr Ala Leu Leu Ser Ala Ala Glu Thr Leu 

90 95 100 

ace ttc aag aat ttt tct tct ata aac ttt aca ggg aac caa teg aca 

Thr Phe Lys Asn Phe Ser Ser lie Asn Phe Thr Gly Asn Gin Ser Thr 

105 110 115 

gga ctt ggc ggc etc ate tac gga aaa gat att gtt ttc caa tct ate 

Gly Leu Gly Gly Leu lie Tyr Gly Lys Asp lie Val Phe Gin Ser lie 

120 125 130 

aaa gat ttg ate ttc act acg aac cgt gtt gee tat tct cca gca tct 

Lys Asp. Leu lie Phe Thr Thr Asn Arg Val Ala Tyr Ser Pro Ala Ser 

135 140 145 

gta act acg teg gca act ccc gca ate act aca gta act aca gga gee 

Val Thr Thr Ser Ala Thr Pro Ala lie Thr Thr Val Thr Thr Gly Ala 

150 155 160 165 



60 



tct get etc caa cct aca gac tea etc act gtc gaa aac ata tec caa 
Ser Ala Leu Gin Pro Thr Asp Ser Leu Thr Val Glu Asn lie Ser Gin 
170 175 180 



642 
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teg ate aag ttt ttt ggg aac ctt gec aac ttc ggc tct gca att age 
Ser lie Lys Phe Phe Gly Asn Leu Ala Asn Phe Gly Ser Ala lie Ser 
185 190 195 

agt tct ccc acg gca gtc gtt aaa ttc ate aat aac acc get ace atg 
Ser Ser Pro Thr Ala Val Val Lys Phe lie Asn Asn Thr Ala Thr Met 
200 205 210 

age ttc tec cat aac ttt act teg tea gga ggc ggc gtg att tat gga 
Ser Phe Ser His Asn Phe Thr Ser Ser Gly Gly Gly Val lie Tyr Gly 
215 220 225 

gga age tct etc ctt ttt gaa aac aat tct gga tgc ate ate ttc acc 
Gly Ser Ser Leu Leu Phe Glu Asn Asn Ser Gly Cys lie lie Phe Thr 
230 235 240 245 

gec aac tec tgt gtg aac age tta aaa ggc gtc acc cct tea tea gga 
Ala Asn Ser Cys Val Asn Ser Leu Lys Gly Val Thr Pro Ser Ser Gly 
250 255 260 

acc tat get tta gga agt ggc gga gec ate tgc ate cct acg gga act 
Thr Tyr Ala Leu Gly Ser Gly Gly Ala lie Cys lie Pro Thr Gly Thr 
265 270 275 

ttc gaa tta aaa aac aat cag ggg aag tgc acc ttc tct tat aat ggt 
Phe Glu Leu Lys Asn Asn Gin Gly Lys Cys Thr Phe Ser Tyr Asn Gly 
280 285 290 

aca cca aat gat gcg ggt gcg ate tac gec gaa acc tgc aac ate gta 
Thr Pro Asn Asp Ala Gly Ala lie Tyr Ala Glu Thr Cys Asn lie Val 
295 300 305 

ggg aac cag ggt gee ttg etc eta gat age aac act gca gcg aga aat 
Gly Asn Gin Gly Ala Leu Leu Leu Asp Ser Asn Thr Ala Ala Arg Asn 
310 315 320 325 

ggc gga gee ate tgt get aaa gtg etc aat att caa gga cgc ggt cct 
Gly Gly Ala lie Cys Ala Lys Val Leu Asn lie Gin Gly Arg Gly Pro 
330 335 340 

att gaa ttc tct aga aac cgc gcg gag aag ggt gga get att ttc ata 
He Glu Phe Ser Arg Asn Arg Ala Glu Lys Gly Gly Ala He Phe He 
345 350 355 

ggc ccc tct gtt gga gac cct gcg aag caa aca teg aca ctt acg att 
Gly Pro Ser Val Gly Asp Pro Ala Lys Gin Thr Ser Thr Leu Thr lie 
360 365 370 

ttg get tec gaa ggt gat att gcg ttc caa gga aac atg etc aat aca 
Leu Ala Ser Glu Gly Asp He Ala Phe Gin Gly Asn Met Leu Asn Thr 
375 380 385 

aaa cct gga ate cgc aat gee ate act gta gaa gca ggg gga gag att 
Lys Pro Gly He Arg Asn Ala He Thr Val Glu Ala Gly Gly Glu He 
390 395 400 405 

gtg tct eta tct gca caa gga ggc tea cgt ctt gta ttt tat gat ccc 
Val Ser Leu Ser Ala Gin Gly Gly Ser Arg Leu Val Phe Tyr Asp Pro 
410 415 420 
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att aca cat age etc cca acc aca agt ccg tct aat aaa gac att aca 

lie Thr His Ser Leu Pro Thr Thr Ser Pro Ser Asn Lys Asp lie Thr 

425 430 435 

ate aac get aat ggc get tea gga tct gta gtc ttt aca agt aag gga 

lie Asn Ala Asn Gly Ala Ser Gly Ser Val Val Phe Thr Ser Lys Gly 

440 445 450 

etc tec tct aca gaa etc ctg ttg cct gee aac acg aca act ata ctt 

Leu Ser Ser Thr Glu Leu Leu Leu Pro Ala Asn Thr Thr Thr lie Leu 

455 460 465 

eta gga aca gtc aag ate get agt gga gaa ctg aag att act gac aat 

Leu Gly Thr Val Lys lie Ala Ser Gly Glu Leu Lys lie Thr Asp Asn 

470 475 480 485 

gcg gtt gtc aat gtt get ggc ttc get act cag ggc tea ggt cag ctt 

Ala Val Val Asn Val Ala Gly Phe Ala Thr Gin Gly Ser Gly Gin Leu 

490 495 500 

acc ctg ggc tct gga gga acc tta ggg ctg gca aca ccc acg gga gca 

Thr Leu Gly Ser Gly Gly Thr Leu Gly Leu Ala Thr Pro Thr Gly Ala 

505 510 515 

cct gec get gta gac ttt acg att gga aag tta gca ttc gat cct ttt 

Pro Ala Ala Val Asp Phe Thr lie Gly Lys Leu Ala Phe Asp Pro Phe 

520 525 530 

tec ttc eta aaa aga gat ttt gtt tea gca tea gta aat gca ggc aca 

Ser Phe Leu Lys Arg Asp Phe Val Ser Ala Ser Val Asn Ala Gly Thr 

535 540 545 

aaa aac gtc act tta aca gga get ctg gtt ctt gat gaa cat gac gtt 

Lys Asn Val Thr Leu Thr Gly Ala Leu Val Leu Asp Glu His Asp Val 

550 555 560 565 

aca gat ctt tat gat atg gtg tea tta caa tct cca gta gca att cct 

Thr Asp Leu Tyr Asp Met Val Ser Leu Gin Ser Pro Val Ala lie Pro 

570 575 580 

ate get gtt ttc aaa gga gca acc gtt act aag aca gga ttt cct gat 

lie Ala Val Phe Lys Gly Ala Thr Val Thr Lys Thr Gly Phe Pro Asp 

585 590 595 

ggg gag att gcg act cca age cac tac ggc tac caa gga aag tgg tec 

Gly Glu lie Ala Thr Pro Ser His Tyr Gly Tyr Gin Gly Lys Trp Ser 

600 605 610 

tac aca tgg tec cgt ccc ctg tta att cca get cct gat gga gga ttt 

Tyr Thr Trp Ser Arg Pro Leu Leu lie Pro Ala Pro Asp Gly Gly Phe 

615 620 625 

cct gga ggt ccc tct cct age gca aat act etc tat get gta tgg aat 

Pro Gly Gly Pro Ser Pro Ser Ala Asn Thr Leu Tyr Ala Val Trp Asn 

630 635 640 645 

tea gac act etc gtg cgt tct acc tat ate tta gat ccc gag cgt tac 

Ser Asp Thr Leu Val Arg Ser Thr Tyr lie Leu Asp Pro Glu Arg Tyr 

650 655 660 
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gga gaa att gtc age aac age tta tgg att tec ttc tta gga aat cag 2130 

Gly Glu lie Val Ser Asn Ser Leu Trp lie Ser Phe Leu Gly Asn Gin 
665 670 675 

gca ttc tct gat att etc caa gat gtt ctt ttg ata gat cat ccc ggg 2178 

Ala Phe Ser Asp lie Leu Gin Asp Val Leu Leu lie Asp His Pro Gly 
680 685 690 

ttg tec ata ace gcg aaa get tta gga gee tat gtc gaa cac aca cca 2226 

Leu Ser lie Thr Ala Lys Ala Leu Gly Ala Tyr Val Glu His Thr Pro 
695 700 705 

aga caa gga cat gag ggc ttt tea ggt cgc tat gga ggc tac caa get 22 74 

Arg Gin Gly His Glu Gly Phe Ser Gly Arg Tyr Gly Gly Tyr Gin Ala 
710 715 720 725 

gcg eta tct atg aac tac acg gac cac act acg tta gga ctt tct ttc 2322 

Ala Leu Ser Met Asn Tyr Thr Asp His Thr Thr Leu Gly Leu Ser Phe 

730 735 740 

ggg cag ctt tat gga aaa act aac gec aac ccc tac gat tea cgt tgc 23 70 

Gly Gin Leu Tyr Gly Lys Thr Asn Ala Asn Pro Tyr Asp Ser Arg Cys 
745 750 755 

tea gaa caa atg tat tta etc teg ttc ttt ggt caa ttc cct ate gtg 2418 

Ser Glu Gin Met Tyr Leu Leu Ser Phe Phe Gly Gin Phe Pro lie Val 
760 765 770 

act caa aag age gag gee tta att tec tgg aaa gca get tat ggt tat 24 66 

Thr Gin Lys Ser Glu Ala Leu lie Ser Trp Lys Ala Ala Tyr Gly Tyr 
775 780 785 

tec aaa aat cac eta aat ace ace tac etc aga cct gac aaa get cca 2514 

Ser Lys Asn His Leu Asn Thr Thr Tyr Leu Arg Pro Asp Lys Ala Pro 
790 795 800 805 

aaa tct caa ggg caa tgg cat aac aat agt tac tat gtt ctt att tct 2562 

Lys Ser Gin Gly' Gin Trp His Asn Asn Ser Tyr Tyr Val Leu lie Ser 

810 815 820 

gca gaa cat cct ttc eta aac tgg tgt ctt ctt aca aga cct ctg get 2610 

Ala Glu His Pro Phe Leu Asn Trp Cys Leu Leu Thr Arg Pro Leu Ala 
825 830 835 

caa get tgg gat ctt tea ggt ttt att tec gca gaa ttc eta ggt ggt 2658 

Gin Ala Trp Asp Leu Ser Gly Phe lie Ser Ala Glu Phe Leu Gly Gly 
840 845 850 

tgg caa agt aag ttc aca gaa act gga gat ctg caa cgt age ttt agt 2706 

Trp Gin Ser Lys Phe Thr Glu Thr Gly Asp Leu Gin Arg Ser Phe Ser 
855 860 865 

aga ggt aaa ggg tac aat gtt tec eta ccg ata gga tgt tct tct caa 2754 

Arg Gly Lys Gly Tyr Asn Val Ser Leu Pro lie Gly Cys Ser Ser Gin 
870 875 880 885 

tgg ttc aca cca ttt aag aag get cct tct aca ctg ace ate aaa ctt 2802 

Trp Phe Thr Pro Phe Lys Lys Ala Pro Ser Thr Leu Thr lie Lys Leu 

890 895 900 
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gcc tac aag cct gat ate tat cgt gtc aac cct cac aat att gtg act 2850 

Ala Tyr Lys Pro Asp lie Tyr Arg Val Asn Pro His Asn lie Val Thr 

905 910 915 

gtc gtc tea aac caa gag age act teg ate tea gga gca aat eta cgc 2898 

Val Val Ser Asn Gin Glu Ser Thr Ser lie Ser Gly Ala Asn Leu Arg 
920 925 930 

10 

cgc cac ggt ttg ttt gta caa ate cat gat gta gta gat etc acc gag 2946 

Arg His Gly Leu Phe Val Gin lie His Asp Val Val Asp Leu Thr Glu 
935 940 945 

gac act cag gcc ttt eta aac tat acc ttt gac ggg aaa aat gga ttt 2994 

Asp Thr Gin Ala Phe Leu Asn Tyr Thr Phe Asp Gly Lys Asn Gly Phe 
950 955 960 965 

aca aac cac cga gtg tct aca gga eta aaa tec aca ttt taaaactcta 3043 

2 0 Thr Asn His Arg Val Ser Thr Gly Leu Lys Ser Thr Phe 

970 975 

agetctgett agagttttct gtagccccgg tegtcttaga atcctctatc catcatcgaa 3103 
gaacttagca atgaaggeca agattctcac tctatgagaa ccccccc 3150 



<210> 26 

<211> 9 

<212> PRT 

3 0 <213> Chlamydia pneumoniae 



<220> 

<221> SITE 

<222> (936) . . (944) 



<400> 26 

Gly Leu Phe Val Gin lie His Asp Val 
1 5 



40 

<210> 27 

<211> 841 

<212> PRT 

<213> Chlamydia pneumoniae 



<400> 27 

Met Lys lie Pro Leu Arg Phe Leu Leu lie Ser Leu Val Pro Thr Leu 
15 10 15 



5 0 Ser Met Ser Asn Leu Leu Gly Ala Ala Thr Thr Glu Glu Leu Ser Ala 
20 25 30 

Ser Asn Ser Phe Asp Gly Thr Thr Ser Thr Thr Ser Phe Ser Ser Lys 
35 40 45 

Thr Ser Ser Ala Thr Asp Gly Thr Asn Tyr Val Phe Lys Asp Ser Val 
50 55 60 



Val lie Glu Asn Val Pro Lys Thr Gly Glu Thr Gin Ser Thr Ser Cys 
60 65 70 75 80 
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Phe Lys Asn 



Phe Ser Phe 



Ala lie Gly 
115 

10 

Ser Ala Leu 
130 

Leu Gly Ala 
145 

Lys Val Leu 



2 0 Asn Cys Ala 



lie Gly Asn 
195 

Leu Thr Leu 
210 

Pro Thr Ala 
30 225 

Thr Leu Ser 



Thr lie Gly 



Thr Ser Ala 
275 

40 

Tyr Phe Tyr 
290 

Ala Leu Asn 
305 

Gly Thr lie 



5 0 Asp Glu Lys 



Asn Gly Thr 
355 

Phe Ser Gin 
370 

Leu Val Ala 
60 385 



Asp Ala Ala Ala 
85 

Thr Phe Ser Asn 
100 

Ser Glu Ala Ala 



Ser Phe Leu Lys 
135 

He Asn Val Lys 
150 

He Gin Asp Asn 
165 

Gly Ser Leu Lys 
180 

Ser Ser Ser Thr 



Ser Ser Gly Gly 
215 

Ala Gly Lys Gly 
230 

He Ser Gly Asp 
245 

Ala Thr Gly Thr 
260 

Lys He Thr Ala 



Asp Pro He Thr 
295 

He Asn Ser Pro 
310 

Val Phe Ser Gly 
325 

Asn Arg Thr Ser 
340 

Val Val Leu Lys 



Asp Ala Asn Ser 
375 

Asn Thr Glu Ser 
390 
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Gly Asp Leu Asn Phe 
90 

He Asp Ala Thr Thr 
105 

Asn Lys Thr Val Thr 
120 

Ser Pro Ala Ser Thr 
140 

Gly Asn Leu Ser Leu 
155 

Phe Ser Thr Gly Asp 
170 

He Ala Asn Asn Lys 
185 

Arg Gly Gly Ala He 
200 

Glu Thr Leu Phe Gin 
220 

Gly Ala He Ala He 
235 

Ser Gly Asp He He 
250 

Val Ser His Ser Ala 
265 

Leu Arg Ala Ala Gin 
280 

Val Thr Gly Ser Thr 
300 

Asp Thr Gly Asp Asn 
315 

Glu Lys Leu Thr Glu 
330 

Lys Leu Leu Gin Asn 
345 

Gly Asp Val Val Leu 
360 

Lys Leu He Met Asp 
380 

He Glu Leu Thr Asn 
395 



Leu Gly Gly Gly 
95 

Ala Ser Gly Ala 
110 

Leu Ser Gly Phe 
125 

Val Thr Asn Gly 



Leu Asp Asn Asp 
160 

Gly Gly Ala He 
175 

Ser Leu Ser Phe 
190 

His Thr Lys Asn 
205 

Gly Asn Thr Ala 



Ala Asp Ser Gly 
240 

Phe Glu Gly Asn 
255 

He Asp Leu Gly 
270 

Gly His Thr He 
285 

Ser Val Ala Asp 



Lys Glu Tyr Thr 
320 

Ala Glu Ala Lys 
335 

Val Ala Phe Lys 
350 

Ser Ala Asn Gly 
365 

Leu Gly Thr Ser 



Leu Glu He Asn 
400 
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lie Asp Ser Leu Arg Asn Gly Lys Lys lie Lys Leu Ser Ala Ala Thr 
405 410 415 

Ala Gin Lys Asp lie Arg lie Asp Arg Pro Val Val Leu Ala lie Ser 
420 425 430 

Asp Glu Ser Phe Tyr Gin Asn Gly Phe Leu Asn Glu Asp His Ser Tyr 
10 435 440 445 

Asp Gly lie Leu Glu Leu Asp Ala Gly Lys Asp lie Val lie Ser Ala 
450 455 460 

Asp Ser Arg Ser He Asp Ala Val Gin Ser Pro Tyr Gly Tyr Gin Gly 
465 470 475 480 

Lys Trp Thr He Asn Trp Ser Thr Asp Asp Lys Lys Ala Thr Val Ser 
485 490 495 

20 

Trp Ala Lys Gin Ser Phe Asn Pro Thr Ala Glu Gin Glu Ala Pro Leu 
500 505 510 

Val Pro Asn Leu Leu Trp Gly Ser Phe He Asp Val Arg Ser Phe Gin 
515 520 525 

Asn Phe He Glu Leu Gly Thr Glu Gly Ala Pro Tyr Glu Lys Arg Phe 
530 535 540 

3 0 Trp Val Ala Gly He Ser Asn Val Leu His Arg Ser Gly Arg Glu Asn 
545 550 555 560 

Gin Arg Lys Phe Arg His Val Ser Gly Gly Ala Val Val Gly Ala Ser 
565 570 575 

Thr Arg Met Pro Gly Gly Asp Thr Leu Ser Leu Gly Phe Ala Gin Leu 
580 585 590 

Phe Ala Arg Asp Lys Asp Tyr Phe Met Asn Thr Asn Phe Ala Lys Thr 
40 595 600 605 

Tyr Ala Gly Ser Leu Arg Leu Gin His Asp Ala Ser Leu Tyr Ser Val 
610 615 620 

Val Ser He Leu Leu Gly Glu Gly Gly Leu Arg Glu He Leu Leu Pro 
625 630 635 640 

Tyr Val Ser Lys Thr Leu Pro Cys Ser Phe Tyr Gly Gin Leu Ser Tyr 
645 650 655 

50 

Gly His Thr Asp His Arg Met Lys Thr Glu Ser Leu Pro Pro Pro Pro 
660 665 670 

Pro Thr Leu Ser Thr Asp His Thr Ser Trp Gly Gly Tyr Val Trp Ala 
675 680 685 

Gly Glu Leu Gly Thr Arg Val Ala Val Glu Asn Thr Ser Gly Arg Gly 
690 695 700 



6 0 Phe Phe Gin Glu Tyr Thr Pro Phe Val Lys Val Gin Ala Val Tyr Ala 
705 710 715 720 
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Arg Gin Asp Ser Phe Val Glu Leu Gly Ala lie Ser Arg Asp Phe Ser 
725 730 735 

Asp Ser His Leu Tyr Asn Leu Ala lie Pro Leu Gly lie Lys Leu Glu 
740 745 750 

Lys Arg Phe Ala Glu Gin Tyr Tyr His Val Val Ala Met Tyr Ser Pro 
10 755 760 765 

Asp Val Cys Arg Ser Asn Pro Lys Cys Thr Thr Thr Leu Leu Ser Asn 
770 775 780 

Gin Gly Ser Trp Lys Thr Lys Gly Ser Asn Leu Ala Arg Gin Ala Gly 
785 790 795 800 

lie Val Gin Ala Ser Gly Phe Arg Ser Leu Gly Ala Ala Ala Glu Leu 
805 810 815 

20 

Phe Gly Asn Phe Gly Phe Glu Trp Arg Gly Ser Ser Arg Ser Tyr Asn 
820 825 830 

Val Asp Ala Gly Ser Lys lie Lys Phe 
835 840 



<210> 28 
<211> 841 

3 0 <212> PRT 

<213> Chlamydia pneumoniae 

<400> 28 

Met Lys lie Pro Leu Arg Phe Leu Leu lie Ser Leu Val Pro Thr Leu 
15 10 15 

Ser Met Ser Asn Leu Leu Gly Ala Ala Thr Thr Glu Glu Leu Ser Ala 
20 25 30 

4 0 Ser Asn Ser Phe Asp Gly Thr Thr Ser Thr Thr Ser Phe Ser Ser Lys 

35 40 45 

Thr Ser Ser Ala Thr Asp Gly Thr Asn Tyr Val Phe Lys Asp Ser Val 
50 55 60 

Val He Glu Asn Val Pro Lys Thr Gly Glu Thr Gin Ser Thr Ser Cys 
65 70 75 80 

Phe Lys Asn Asp Ala Ala Ala Gly Asp Leu Asn Phe Leu Gly Gly Gly 
50 85 90 95 

Phe Ser Phe Thr Phe Ser Asn He Asp Ala Thr Thr Ala Ser Gly Ala 
100 105 110 

Ala He Gly Ser Glu Ala Ala Asn Lys Thr Val Thr Leu Ser Gly Phe 
115 120 125 

Ser Ala Leu Ser Phe Leu Lys Ser Pro Ala Ser Thr Val Thr Asn Gly 
130 135 140 

60 



47/83 



Leu Gly Ala lie Asn Val Lys Gly Asn Leu Ser Leu Leu Asp Asn Asp 
145 150 155 160 

Lys Val Leu lie Gin Asp Asn Phe Ser Thr Gly Asp Gly Gly Ala lie 
165 170 175 

Asn Cys Ala Gly Ser Leu Lys lie Ala Asn Asn Lys Ser Leu Ser Phe 
10 180 185 190 

lie Gly Asn Ser Ser Ser Thr Arg Gly Gly Ala lie His Thr Lys Asn 
195 200 205 

Leu Thr Leu Ser Ser Gly Gly Glu Thr Leu Phe Gin Gly Asn Thr Ala 
210 215 220 

Pro Thr Ala Ala Gly Lys Gly Gly Ala lie Ala lie Ala Asp Ser Gly 
225 230 235 240 

20 

Thr Leu Ser lie Ser Gly Asp Ser Gly Asp lie lie Phe Glu Gly Asn 
245 250 255 

Thr lie Gly Ala Thr Gly Thr Val Ser His Ser Ala He Asp Leu Gly 
260 265 270 

Thr Ser Ala Lys He Thr Ala Leu Arg Ala Ala Gin Gly His Thr He 
275 280 285 

3 0 Tyr Phe Tyr Asp Pro He Thr Val Thr Gly Ser Thr Ser Val Ala Asp 
290 295 300 

Ala Leu Asn He Asn Ser Pro Asp Thr Gly Asp Asn Lys Glu Tyr Thr 
305 310 315 320 

Gly Thr He Val Phe Ser Gly Glu Lys Leu Thr Glu Ala Glu Ala Lys 
325 330 335 

Asp Glu Lys Asn Arg Thr Ser Lys Leu Leu Gin Asn Val Ala Phe Lys 
40 340 345 350 

Asn Gly Thr Val Val Leu Lys Gly Asp Val Val Leu Ser Ala Asn Gly 
355 360 365 

Phe Ser Gin Asp Ala Asn Ser Lys Leu He Met Asp Leu Gly Thr Ser 
370 375 380 

Leu Val Ala Asn Thr Glu Ser He Glu Leu Thr Asn Leu Glu lie Asn 
385 390 395 400 

50 

He Asp Ser Leu Arg Asn Gly Lys Lys He Lys Leu Ser Ala Ala Thr 
405 410 415 

Ala Gin Lys Asp He Arg He Asp Arg Pro Val Val Leu Ala He Ser 
420 425 430 

Asp Glu Ser Phe Tyr Gin Asn Gly Phe Leu Asn Glu Asp His Ser Tyr 
435 440 445 



6 0 Asp Gly He Leu Glu Leu Asp Ala Gly Lys Asp He Val He Ser Ala 
450 455 460 
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Asp Ser Arg Ser lie Asp Ala Val Gin Ser Pro Tyr Gly Tyr Gin Gly 
465 470 475 480 

Lys Trp Thr lie Asn Trp Ser Thr Asp Asp Lys Lys Ala Thr Val Ser 
485 490 495 

Trp Ala Lys Gin Ser Phe Asn Pro Thr Ala Glu Gin Glu Ala Pro Leu 
10 500 505 510 

Val Pro Asn Leu Leu Trp Gly Ser Phe lie Asp Val Arg Ser Phe Gin 
515 520 525 

Asn Phe lie Glu Leu Gly Thr Glu Gly Ala Pro Tyr Glu Lys Arg Phe 
530 535 540 

Trp Val Ala Gly lie Ser Asn Val Leu His Arg Ser Gly Arg Glu Asn 
545 550 555 560 

20 

Gin Arg Lys Phe Arg His Val Ser Gly Gly Ala Val Val Gly Ala Ser 
565 570 575 

Thr Arg Met Pro Gly Gly Asp Thr Leu Ser Leu Gly Phe Ala Gin Leu 
580 585 590 

Phe Ala Arg Asp Lys Asp Tyr Phe Met Asn Thr Asn Phe Ala Lys Thr 
595 600 605 

3 0 Tyr Ala Gly Ser Leu Arg Leu Gin His Asp Ala Ser Leu Tyr Ser Val 
610 615 620 

Val Ser lie Leu Leu Gly Glu Gly Gly Leu Arg Glu lie Leu Leu Pro 
625 630 635 640 

Tyr Val Ser Lys Thr Leu Pro Cys Ser Phe Tyr Gly Gin Leu Ser Tyr 
645 650 655 

Gly His Thr Asp His Arg Met Lys Thr Glu Ser Leu Pro Pro Pro Pro 
40 660 665 670 

Pro Thr Leu Ser Thr Asp His Thr Ser Trp Gly Gly Tyr Val Trp Ala 
675 680 685 

Gly Glu Leu Gly Thr Arg Val Ala Val Glu Asn Thr Ser Gly Arg Gly 
690 695 700 

Phe Phe Gin Glu Tyr Thr Pro Phe Val Lys Val Gin Ala Val Tyr Ala 
705 710 715 720 

50 

Arg Gin Asp Ser Phe Val Glu Leu Gly Ala lie Ser Arg Asp Phe Ser 
725 730 735 

Asp Ser His Leu Tyr Asn Leu Ala lie Pro Leu Gly lie Lys Leu Glu 
740 745 750 

Lys Arg Phe Ala Glu Gin Tyr Tyr His Val Val Ala Met Tyr Ser Pro 
755 760 765 



60 Asp Val Cys Arg Ser Asn Pro Lys 
770 775 



Cys Thr Thr Thr Leu Leu Ser Asn 
780 
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Gin Gly Ser Trp Lys Thr Lys Gly Ser Asn Leu Ala Arg Gin Ala Gly 

785 790 795 800 

lie Val Gin Ala Ser Gly Phe Arg Ser Leu Gly Ala Ala Ala Glu Leu 

805 810 815 

Phe Gly Asn Phe Gly Phe Glu Trp Arg Gly Ser Ser Arg Ser Tyr Asn 

10 820 825 830 



Val Asp Ala Gly Ser Lys lie Lys Phe 
835 840 



<210> 29 
<211> 245 
<212> PRT 

<213> Chlamydia pneumoniae 

20 

<400> 29 

Met Pro Pro Leu Asn Ala Asp Asp Val Leu Pro Arg Asp His Leu Ser 
15 10 15 

Asp Gly Ser Phe Ser Asp Thr Tyr Pro Asp lie Thr Thr Gin Ala lie 
20 25 30 

lie Leu lie Phe Leu Ala Leu Ser Pro Phe Leu Val Met Leu Leu Thr 
35 40 45 

30 

Ser Tyr Leu Lys lie lie lie Thr Leu Val Leu Leu Arg Asn Ala Leu 
50 55 60 

Gly Val Gin Gin Thr Pro Pro Ser Gin Val Leu Asn Gly lie Ala Leu 
65 70 75 80 

lie Leu Ser lie Tyr Val Met Phe Pro Thr Gly Val Ala Met Tyr Lys 
85 90 95 

4 0 Asp Ala Arg Lys Glu lie Glu Ala Asn Thr lie Pro Gin Ser Leu Phe 
100 105 110 

Thr Ala Glu Gly Ala Glu Thr Val Phe Val Ala Leu Asn Lys Ser Lys 
115 120 125 

Glu Pro Leu Arg Ser Phe Leu lie Arg Asn Thr Pro Lys Ala Gin lie 
130 135 140 

Gin Ser Phe Tyr Lys lie Ser Gin Lys Thr Phe Pro Ser Glu lie Arg 
50 145 150 155 160 

Ala His Leu Thr Ala Ser Asp Phe Val lie lie lie Pro Ala Phe lie 
165 170 175 

Met Gly Gin lie Lys Asn Ala Phe Glu lie Gly Val Leu lie Tyr Leu 
180 185 190 

Pro Phe Phe Val lie Asp Leu Val Thr Ala Asn Val Leu Val Ala Met 
195 200 205 

60 
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Gin Met Met Met Leu Ser Pro Leu Ser lie Ser Leu Pro Leu Lys Leu 

210 215 220 

Leu Leu lie Val Met Val Asp Gly Trp Thr Leu Leu Leu Gin Gly Leu 

225 230 235 240 



Met lie Ser Phe Lys 
10 245 



<210> 30 
<211> 233 
<212> PRT 

<213> Chlamydia pneumoniae 
<400> 30 

Met Lys Phe Phe Ser Leu lie Phe Lys Asp Asp Asp Val Ser Pro Asn 
20 1 5 10 15 

Lys Lys Val Leu Ser Pro Glu Ala Phe Ser Ala Phe Leu Asp Ala Lys 
20 25 30 

Glu Leu Leu Glu Lys Thr Lys Ala Asp Ser Glu Ala Tyr Val Ala Glu 
35 40 45 

Thr Glu Gin Lys Cys Ala Gin lie Arg Gin Glu Ala Lys Asp Gin Gly 
50 55 60 

30 

Phe Lys Glu Gly Ser Glu Ser Trp Ser Lys Gin lie Ala Phe Leu Glu 
65 70 75 80 

Glu Glu Thr Lys Asn Leu Arg lie Arg Val Arg Glu Ala Leu Val Pro 
85 90 95 

Leu Ala lie Ala Ser Val Arg Lys lie lie Gly Lys Glu Leu Glu Leu 
100 105 110 

4 0 His Pro Glu Thr lie Val Ser lie lie Ser Gin Ala Leu Lys Glu Leu 
115 120 125 

Thr Gin Asn Lys His lie lie lie Ser Val Asn Pro Lys Asp Leu Pro 
130 135 140 

Leu Val Glu Lys Ser Arg Pro Glu Leu Lys Asn lie Val Glu Tyr Ala 
145 150 155 160 

Asp Ser Leu lie Leu Thr Ala Lys Pro Asp Val Thr Pro Gly Gly Cys 
50 165 170 175 

lie lie Glu Thr Glu Ala Gly lie lie Asn Ala Gin Leu Asp Val Gin 
180 185 190 

Leu Asp Ala Leu Glu Lys Ala Phe Ser Thr lie Leu Lys Ala Lys Asn 
195 200 205 

Pro Val Asp Glu Pro Ser Glu Thr Ser Ser Ser Thr Asp Ser Ser Ser 
210 215 220 

60 
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Leu Ser Asn Asp Gin Asp Lys Lys Glu 
225 230 



<210> 31 
<211> 322 
<212> PRT 
10 <213> Chlamydia pneumoniae 

<400> 31 

Met Thr Leu Leu Cys Cys Thr Ser Cys Asn Ser Arg Ser Leu lie Val 
15 10 15 

His Gly Leu Pro Gly Arg Glu Ala Asn Glu lie Val Val Leu Leu Val 
20 25 30 

Ser Lys Gly Val Ala Ala Gin Lys Leu Pro Gin Ala Ala Ala Ala Thr 
20 35 40 45 

Ala Gly Ala Ala Thr Glu Gin Met Trp Asp lie Ala Val Pro Ser Ala 
50 55 60 

Gin lie Thr Glu Ala Leu Ala lie Leu Asn Gin Ala Gly Leu Pro Arg 
65 70 75 80 

Met Lys Gly Thr Ser Leu Leu Asp Leu Phe Ala Lys Gin Gly Leu Val 
85 90 95 

30 

Pro Ser Glu Leu Gin Glu Lys lie Arg Tyr Gin Glu Gly Leu Ser Glu 
100 105 110 

Gin Met Ala Ser Thr lie Arg Lys Met Asp Gly Val Val Asp Ala Ser 
115 120 125 

Val Gin lie Ser Phe Thr Thr Glu Asn Glu Asp Asn Leu Pro Leu Thr 
130 135 140 

4 0 Ala Ser Val Tyr lie Lys His Arg Gly Val Leu Asp Asn Pro Asn Ser 
145 150 155 160 

lie Met Val Ser Lys lie Lys Arg Leu lie Ala Ser Ala Val Pro Gly 
165 170 175 

Leu Val Pro Glu Asn Val Ser Val Val Ser Asp Arg Ala Ala Tyr Ser 
180 185 190 

Asp lie Thr lie Asn Gly Pro Trp Gly Leu Thr Glu Glu lie Asp Tyr 
50 195 200 205 

Val Ser Val Trp Gly lie lie Leu Ala Lys Ser Ser Leu Thr Lys Phe 
210 215 220 

Arg Leu lie Phe Tyr Val Leu lie Leu lie Leu Phe Val lie Ser Cys 
225 230 235 240 

Gly Leu Leu Trp Val lie Trp Lys Thr His Thr Leu lie Met Thr Met 
245 250 255 

60 
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Gly Gly Thr Lys Gly Phe Phe Asn Pro Thr Pro Tyr Thr Lys Asn Ala 
260 265 270 

Leu Glu Ala Lys Lys Ala Glu Gly Ala Ala Ala Asp Lys Glu Lys Lys 
275 280 285 

Glu Asp Ala Asp Ser Gin Gly Glu Ser Lys Asn Ala Glu Thr Ser Asp 
10 290 295 300 

Lys Asp Ser Ser Asp Lys Asp Ala Pro Glu Gly Ser Asn Glu lie Glu 
305 310 315 320 



<210> 32 
<211> 226 

2 0 <212> PRT 

<213> Chlamydia pneumoniae 

<400> 32 

Met Thr lie Arg Val Arg Asn Leu Ala Tyr Ser Val Asn Lys Lys Lys 
15 10 15 

lie Leu Asp Gly Val Thr Phe Ser Leu Glu Arg Gly His lie Thr Leu 
20 25 30 

3 0 Phe Val Gly Lys Ser Gly Ser Gly Lys Thr Met lie Leu Arg Ala Leu 

35 40 45 

Ala Gly Leu Val Gin Pro Thr Gin Gly Asp lie Trp lie Glu Gly Glu 
50 55 60 

Ala Pro Ala Leu Val Phe Gin Gin Pro Glu Leu Phe Ser His Met Thr 



Val Leu Gly Asn Cys Thr His Pro Gin lie His lie Lys Gly Arg Ser 
40 85 90 95 

Thr Glu Glu Ala Arg Glu Lys Ala Phe Glu Leu Leu His Leu Leu Asp 
100 105 110 

lie Glu Glu Val Ala Lys Asn Tyr Pro Asp Gin Leu Ser Gly Gly Gin 
115 120 125 

Lys Gin Arg Val Ala lie Val Arg Ser Leu Cys Met Asp Lys His Thr 
130 135 140 

50 

Leu Leu Phe Asp Glu Pro Thr Ser Ala Leu Asp Pro Phe Ala Thr Ala 
145 150 155 160 

Ser Phe Arg His Leu Leu Glu Thr Leu Arg Asp Gin Glu Leu Thr Val 
165 170 175 

Gly Leu Thr Thr His Asp Met Gin Phe Val His Ser Cys Leu Asp Arg 
180 185 190 

6 0 lie Tyr Leu lie Asp Gin Gly Thr Val Ala Gly Val Tyr Asp Lys Arg 
195 200 205 
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Asp Gly Glu Leu Asp Ser Gly His Pro Leu Ser Lys Tyr lie His Ser 
210 215 220 

Ala Gin 

225 



10 <210> 33 
<211> 436 
<212> PRT 

<213> Chlamydia pneumoniae 
<400> 33 

Met Lys Arg Pro Phe Phe Thr Tyr Leu Cys lie lie Phe Tyr Gly Ser 
15 10 15 

Cys Ala Ser Leu Ser Leu His Ala Gly Leu Ser Phe Pro Glu Val Arg 
20 20 25 30 

Gly Ala Thr Ala Ala Val Val His Ala Asp Ser Gly Lys Val Phe Tyr 
35 40 45 

Asp Lys Asp lie Asp Ala Val lie Tyr Pro Ala Ser Met Thr Lys lie 
50 55 60 

Ala Thr Ala Leu Phe lie Leu Lys His Tyr Pro Thr Val Leu Asp Thr 
65 70 75 80 

30 

Leu lie Lys Val Lys Gin Asp Ala lie Ala Ser lie Thr Pro Gin Ala 
85 90 95 

Lys Lys Gin Ser Gly Tyr Arg Ser Pro Pro His Trp Leu Glu Thr Asp 
100 105 110 

Gly Ser Thr lie Gin Leu His Leu Arg Glu Glu Leu Leu Gly Trp Asp 
115 120 125 

4 0 Leu Phe His Ala Leu Leu Val Cys Ser Ala Asn Asp Ala Ala Asn Val 
130 135 140 

Leu Ala Met Ala Cys Cys Gly Ser Val Glu Lys Phe Met Asp Lys Leu 
145 150 155 160 

Asn Phe Phe Leu Lys Glu Glu lie Gly Cys Thr His Thr His Phe Asn 
165 170 175 

Asn Pro His Gly Leu His His Pro Asn His Tyr Thr Thr Thr Arg Asp 
50 180 185 190 

Leu lie Ser lie Met Arg Cys Ala Leu Lys Glu Pro Pro Phe Arg Gly 
195 200 205 

Val lie Ser Thr Thr Ser Tyr Lys lie Gly Ala Thr Asn Leu His Gly 
210 215 220 

Glu Arg lie Leu Ser Pro Thr Asn Lys Leu Leu Leu Pro Gly Ser Thr 
225 230 235 240 

60 
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Tyr His Tyr Pro Pro Ala Leu Gly Gly Lys Thr Gly Thr Thr Lys Thr 
245 250 255 

Ala Gly Lys Asn Leu lie Met Ala Ala Glu Lys Asn Asn Arg Leu Leu 
260 265 270 

Val Thr lie Ala Thr Gly Tyr Ser Gly Pro Val Ser Asp Leu Tyr Gin 
10 275 280 285 

Asp Val lie Ala Leu Cys Glu Thr Val Phe Asn Glu Pro Leu Leu Arg 
290 295 300 

Lys Glu Leu Val Pro Pro Ser Asp Cys Leu Gin Leu Glu lie Ala Asn 
305 310 315 320 

Leu Gly Lys Leu Ser Cys Pro Leu Pro Glu Gly Leu Tyr Tyr Asp Phe 
325 330 335 

20 

Tyr Ala Ser Glu Asp Arg Glu Pro Leu Ser Val Ser Phe lie Ala His 
340 345 350 

Ala Asp Ala Phe Pro lie Glu Gin Gly Asp Leu Leu Gly His Trp Val 
355 360 365 

Phe Tyr Asp Asp Glu Gly Lys Lys lie Ser Ser Gin Pro Phe Tyr Ala 
370 375 380 

3 0 Pro Cys Arg Phe Glu Arg Thr lie Lys Pro Trp Lys Leu Tyr Met Lys 
385 390 395 400 

Arg Val Phe Thr Ser Tyr Arg Thr Tyr Met Ser lie Thr Met Leu Leu 
405 410 415 

Met Tyr Phe Arg lie Arg Lys His Arg Lys Tyr Lys Asn Leu Lys His 
420 425 430 



Tyr Ser Lys He 



<210> 34 
<211> 245 
<212> PRT 

<213> Chlamydia pneumoniae 
<400> 34 

Val Val His Ala Asp Ser Gly Lys Val Phe Tyr 



Asp Lys Asp He Asp Ala Val He Tyr Pro Ala Ser Met Thr Lys lie 

15 20 25 

Ala Thr Ala Leu Phe He Leu Lys His Tyr Pro Thr Val Leu Asp Thr 

30 35 40 

Leu He Lys Val Lys Gin Asp Ala He Ala Ser He Thr Pro Gin Ala 

45 50 55 
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Lys Lys Gin Ser Gly Tyr Arg Ser Pro Pro His Trp Leu Glu Thr Asp 
60 65 70 75 

Gly Ser Thr lie Gin Leu His Leu Arg Glu Glu Leu Leu Gly Trp Asp 
80 85 90 

Leu Phe His Ala Leu Leu Val Cys Ser Ala Asn Asp Ala Ala Asn Val 
10 95 100 105 

Leu Ala Met Ala Cys Cys Gly Ser Val Glu Lys Phe Met Asp Lys Leu 
110 115 120 

Asn Phe Phe Leu Lys Glu Glu lie Gly Cys Thr His Thr His Phe Asn 
125 130 135 

Asn Pro His Gly Leu His His Pro Asn His Tyr Thr Thr Thr Arg Asp 
140 145 150 155 

20 

Leu lie Ser lie Met Arg Cys Ala Leu Lys Glu Pro Pro Phe Arg Gly 
160 165 170 

Val lie Ser Thr Thr Ser Tyr Lys lie Gly Ala Thr Asn Leu His Gly 
175 180 185 

Glu Arg lie Leu Ser Pro Thr Asn Lys Leu Leu Leu Pro Gly Ser Thr 
190 195 200 

3 0 Tyr His Tyr Pro Pro Ala Leu Gly Gly Lys Thr Gly Thr Thr Lys Thr 
205 210 215 

Ala Gly Lys Asn Leu lie Met Ala Ala Glu Lys Asn Asn Arg Leu Leu 
220 225 230 235 

Val Thr lie Ala Thr Gly Tyr Ser Gly Pro 
240 245 

40 <210> 35 
<211> 645 
<212> PRT 

<213> Chlamydia pneumoniae 
<400> 35 

Met Ala Ser Asn Pro lie Leu Gin He Glu Asp Leu Ser He Thr Leu 
15 10 15 

Ala Lys Gin Arg Gin Gin Tyr Pro lie Val Gin Ser Leu Ser Phe Thr 
50 20 25 30 

He Asn Glu Gly Gin Thr Leu Ala He He Gly Glu Ser Gly Ser Gly 
35 40 45 

Lys Ser Val Ser Ala His Ala He Leu Arg Leu Leu Pro Cys Pro Pro 
50 55 60 

Phe Ser Val Ser Gly Gin Val Asn Phe Gin Gly His Asn Leu Leu Thr 
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Ala Ser Arg Ser lie Gin Lys Lys lie lie Gly Thr Glu lie Ser Met 
85 90 95 

lie Phe Gin Asn Pro Gin Ala Ser Leu Asn Pro Val Phe Thr lie Glu 
100 105 110 

Gin Gin Phe Arg Glu lie lie His Thr His Leu Ala Leu Thr Ala Glu 
10 115 120 125 

Val Ala Lys Glu Lys Met Leu Tyr Ala Leu Glu Glu Thr Gly Phe His 
130 135 140 

Asp Pro Arg Leu Cys Leu Asn Leu Tyr Pro His Gin Leu Ser Gly Gly 
145 150 155 160 

Met Leu Gin Arg lie Cys lie Ala Met Ala Leu Leu Cys Ser Pro Lys 
165 170 175 

20 

Leu Leu lie Ala Asp Glu Pro Thr Thr Ala Leu Asp Val Ser Val Gin 
180 185 190 

Tyr Gin lie Leu Gin Leu Leu Lys Thr Leu Gin Lys Lys Thr Gly Met 
195 200 205 

Ser Leu Leu lie lie Thr His Asn Met Gly Val Val Ala Glu Thr Ala 
210 215 220 

3 0 Asp Asp Val Leu Val Leu Tyr Ala Gly Arg Met Val Glu Cys Ala Pro 
225 230 235 240 

Ala Val Gin Met Phe His Asn Pro Ser His Pro Tyr Thr Arg Asp Leu 
245 250 255 

Leu Ala Ser Arg Pro Ser Leu Gin Pro Gin Gin Leu Gly Ser Phe Asn 
260 265 270 

Pro lie Pro Gly Gin Pro Pro His Tyr Thr Ala Phe Pro Ser Gly Cys 
40 275 280 285 

Arg Tyr His Pro Arg Cys Ser Lys lie Leu Asn Arg Cys Ser Ala Glu 
290 295 300 

Ala Pro Glu lie Tyr Pro Val Arg Glu Gly His Lys Val Arg Val Gly 
305 310 315 320 

Cys Met Thr Thr Asn Phe Pro Gin Pro Leu lie Gin Ala Thr Ser Leu 
325 330 335 

50 

Thr Lys His Tyr Tyr Lys Arg Ser Phe Trp Phe Gin Gly Lys Thr lie 
340 345 350 

Ala Ser Arg Pro Val Asp Asp Val Ser Phe Ser Leu Tyr Ser Arg Arg 
355 360 365 

Ala Val Gly Leu lie Gly Glu Ser Gly Ser Gly Lys Ser Thr Leu Ala 
370 375 380 



6 0 Leu Ala Leu Ala Gly Leu Leu Pro Leu Thr Ser Gly Phe Leu Thr Phe 
385 390 395 400 
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Asn Gly Thr Pro lie Lys Leu His Ser Lys His Gly Arg His Gin Leu 
405 410 415 

Arg Ser Gin Val Arg Leu Val Phe Gin Asn Pro Gin Ala Ser Leu Asn 
420 425 430 

Pro Arg Lys Thr lie Leu Asp Ser Leu Gly His Ser Leu Leu Tyr His 
10 435 440 445 

Lys Leu Val Pro Lys Glu Lys Val Leu Ala Thr Val Arg Glu Tyr Leu 
450 455 460 

Glu Leu Val Gly Leu Ser Glu Glu Tyr Phe Tyr Arg Tyr Pro His Gin 
465 470 475 480 

Leu Ser Gly Gly Gin Gin Gin Arg Val Ser lie Ala Arg Ala Leu Leu 
485 490 495 

20 

Gly Val Pro Gin Leu lie lie Cys Asp Glu lie Val Ser Ala Leu Asp 
500 505 510 

Leu Ser lie Gin Ala Gin lie Leu Asn Met Leu Ala Glu Leu Gin Lys 
515 520 525 

Lys Leu Ser Leu Thr Tyr Leu Phe lie Ser His Asp Leu Ala Val Val 
530 535 540 

3 0 Arg Ser Phe Cys Thr Glu Val Phe lie Met Tyr Lys Gly Gin lie Val 
545 550 555 560 

Glu Lys Gly Asn Thr Lys Arg lie Phe Ser Asp Pro Gin His Pro Tyr 
565 570 575 

Thr Arg Met Leu Leu Asn Ala Gin Leu Pro Glu Thr Pro Asp Gin Arg 
580 585 590 

Gin Ser Lys Pro lie Phe Gin Glu Tyr His Lys Asp Ser Glu Glu Ser 
40 595 600 605 

Cys Ser Thr Gly Cys Tyr Phe Tyr Asn Arg Cys Pro Gin Lys Gin Glu 
610 615 620 

Ala Cys Lys Ser Glu lie lie Pro Asn Gin Gly Asp Ala His His Thr 
625 630 635 640 

Tyr Arg Cys lie His 
645 

50 

<210> 36 
<211> 588 
<212> PRT 

<213> Chlamydia pneumoniae 
<400> 36 

lie Leu Gin lie Glu Asp Leu Ser lie Thr Leu 
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Ala Lys Gin Arg Gin Gin Tyr Pro lie Val Gin Ser Leu Ser Phe Thr 
15 20 25 

lie Asn Glu Gly Gin Thr Leu Ala lie lie Gly Glu Ser Gly Ser Gly 
30 35 40 

Lys Ser Val Ser Ala His Ala lie Leu Arg Leu Leu Pro Cys Pro Pro 
10 45 50 55 

Phe Ser Val Ser Gly Gin Val Asn Phe Gin Gly His Asn Leu Leu Thr 
60 65 70 75 

Ala Ser Arg Ser lie Gin Lys Lys lie lie Gly Thr Glu lie Ser Met 
80 85 90 

lie Phe Gin Asn Pro Gin Ala Ser Leu Asn Pro Val Phe Thr lie Glu 
95 100 105 

20 

Gin Gin Phe Arg Glu lie lie His Thr His Leu Ala Leu Thr Ala Glu 
110 115 120 

Val Ala Lys Glu Lys Met Leu Tyr Ala Leu Glu Glu Thr Gly Phe His 
125 130 135 

Asp Pro Arg Leu Cys Leu Asn Leu Tyr Pro His Gin Leu Ser Gly Gly 
140 145 150 155 

3 0 Met Leu Gin Arg lie Cys lie Ala Met Ala Leu Leu Cys Ser Pro Lys 
160 165 170 

Leu Leu lie Ala Asp Glu Pro Thr Thr Ala Leu Asp Val Ser Val Gin 
175 180 185 

Tyr Gin lie Leu Gin Leu Leu Lys Thr Leu Gin Lys Lys Thr Gly Met 
190 195 200 

Ser Leu Leu lie lie Thr His Asn Met Gly Val Val Ala Glu Thr Ala 
40 205 210 215 

Asp Asp Val Leu Val Leu Tyr Ala Gly Arg Met Val Glu Cys Ala Pro 
220 225 230 235 

Ala Val Gin Met Phe His Asn Pro Ser His Pro Tyr Thr Arg Asp Leu 
240 245 250 

Leu Ala Ser Arg Pro Ser Leu Gin Pro Gin Gin Leu Gly Ser Phe Asn 
255 260 265 

50 

Pro lie Pro Gly Gin Pro Pro His Tyr Thr Ala Phe Pro Ser Gly Cys 
270 275 280 

Arg Tyr His Pro Arg Cys Ser Lys lie Leu Asn Arg Cys Ser Ala Glu 
285 290 295 

Ala Pro Glu lie Tyr Pro Val Arg Glu Gly His Lys Val Arg Val Gly 
300 305 310 315 

60 Cys Met Thr Thr Asn Phe Pro Gin Pro Leu lie Gin Ala Thr Ser Leu 
320 325 330 



!i 9 e ID 



59/83 

Thr Lys His Tyr Tyr Lys Arg Ser Phe Trp Phe Gin Gly Lys Thr He 
335 340 345 

Ala Ser Arg Pro Val Asp Asp Val Ser Phe Ser Leu Tyr Ser Arg Arg 
350 355 360 

Ala Val Gly Leu He Gly Glu Ser Gly Ser Gly Lys Ser Thr Leu Ala 
10 365 370 375 

Leu Ala Leu Ala Gly Leu Leu Pro Leu Thr Ser Gly Phe Leu Thr Phe 
380 385 390 395 

Asn Gly Thr Pro lie Lys Leu His Ser Lys His Gly Arg His Gin Leu 
400 405 410 

Arg Ser Gin Val Arg Leu Val Phe Gin Asn Pro Gin Ala Ser Leu Asn 
415 420 425 

20 

Pro Arg Lys Thr lie Leu Asp Ser Leu Gly His Ser Leu Leu Tyr His 
430 435 440 

Lys Leu Val Pro Lys Glu Lys Val Leu Ala Thr Val Arg Glu Tyr Leu 
445 450 455 

Glu Leu Val Gly Leu Ser Glu Glu Tyr Phe Tyr Arg Tyr Pro His Gin 
460 465 470 475 

3 0 Leu Ser Gly Gly Gin Gin Gin Arg Val Ser He Ala Arg Ala Leu Leu 
480 485 490 

Gly Val Pro Gin Leu He He Cys Asp Glu He Val Ser Ala Leu Asp 
495 500 505 

Leu Ser He Gin Ala Gin He Leu Asn Met Leu Ala Glu Leu Gin Lys 
510 515 520 

Lys Leu Ser Leu Thr Tyr Leu Phe He Ser His Asp Leu Ala Val Val 
40 525 530 535 

Arg Ser Phe Cys Thr Glu Val Phe He Met Tyr Lys Gly Gin He Val 
540 545 550 555 

Glu Lys Gly Asn Thr Lys Arg He Phe Ser Asp Pro Gin His Pro Tyr 
560 565 570 

Thr Arg Met Leu Leu Asn Ala Gin Leu Pro Glu Thr Pro Asp Gin Arg 
575 585 585 



<210> 37 
<211> 698 
<212> PRT 

<213> Chlamydia pneumoniae 



<400> 37 

6 0 Met Pro Gly He Glu Lys Ala Ala Thr Thr Val Ala Val Pro Gin Asp 
15 10 15 
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Lys Ser Glu Glu Glu Lys Val Lys Glu Arg Leu Thr Lys Arg Glu Leu 
20 25 30 

Thr Cys Glu Asp Leu Lys Asp Asn Gly Tyr Thr Val Asn Phe Glu Asp 
35 40 45 

lie Ser lie Leu Glu Leu Leu Gin Phe Val Ser Lys lie Ser Gly Thr 
10 50 55 60 

Asn Phe Val Phe Asp Ser Asn Asp Leu Gin Phe Asn Val Thr lie Val 
65 70 75 80 

Ser His Asp Pro Thr Ser Val Asp Asp Leu Ser Thr lie Leu Leu Gin 
85 90 95 

Val Leu Lys Met His Asp Leu Lys Val Val Glu Gin Gly Asn Asn Val 
100 105 110 

20 

Leu lie Tyr Arg Asn Pro His Leu Ser Lys Leu Ser Thr Val Val Thr 
115 120 125 

Asp Ser Ser Leu Lys Glu Thr Cys Glu Ala Val Val Val Thr Arg Val 
130 135 140 

Phe Arg Leu Tyr Arg Arg Gin Pro Ser Ala Ala Val Asn He He Gin 
145 150 155 160 

3 0 Pro Leu Leu Ser His Asp Ala He Val Ser Ala Ser Glu Ala Thr Arg 
165 170 175 

His Val He He Ser Asp He Ala Gly Asn Val Asp Lys Val Ser Asp 
180 185 190 

Leu Leu Ala Ala Leu Asp Cys Pro Gly Thr Ser Val Asp Met Thr Glu 
195 200 205 

Tyr Glu Val Lys Tyr Ala Asn Pro Ala Ala Leu Val Ser Tyr Cys Gin 
40 210 215 220 

Asp Val Leu Gly Thr Leu Ala Glu Asp Asp Ala Phe Gin Met Phe He 
225 230 235 240 

Gin Pro Gly Thr Asn Lys He Phe Val Val Ser Ser Pro Arg Leu Ala 
245 250 255 

Asn Lys Ala Glu Gin Leu Leu Lys Ser Leu Asp Val Pro Glu Met Ala 
260 265 270 

50 

His Thr Leu Asp Asp Pro Ala Ser Thr Ala Leu Ala Leu Gly Gly Thr 
275 280 285 

Gly Thr Thr Ser Pro Lys Ser Leu Arg Phe Phe Met Tyr Lys Leu Lys 
290 295 300 

Tyr Gin Asn Gly Glu Val He Ala Asn Ala Leu Gin Asp He Gly Tyr 
305 310 315 320 

6 0 Asn Leu Tyr Val Thr Thr Ala Met Asp Glu Asp Phe He Asn Thr Leu 
325 330 335 
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Asn Ser lie Gin Trp Leu Glu Val Asn Asn Ser lie Val lie lie Gly 
340 345 350 

Asn Gin Gly Asn Val Asp Arg Val lie Gly Leu Leu Asn Gly Leu Asp 
355 360 365 

Leu Pro Pro Lys Gin Val Tyr lie Glu Val Leu lie Leu Asp Thr Ser 
10 370 375 380 

Leu Glu Lys Ser Trp Asp Phe Gly Val Gin Trp Val Ala Leu Gly Asp 
385 390 395 400 

Glu Gin Ser Lys Val Ala Tyr Ala Ser Gly Leu Leu Asn Asn Thr Gly 
405 410 415 

He Ala Thr Pro Thr Lys Ala Thr Val Pro Pro Gly Thr Pro Asn Pro 
420 425 430 

20 

Gly Ser He Pro Leu Pro Thr Pro Gly Gin Leu Thr Gly Phe Ser Asp 
435 440 445 

Met Leu Asn Ser Ser Ser Ala Phe Gly Leu Gly lie He Gly Asn Val 
450 455 460 

Leu Ser His Lys Gly Lys Ser Phe Leu Thr Leu Gly Gly Leu Leu Ser 
465 470 475 480 

3 0 Ala Leu Asp Gin Asp Gly Asp Thr Val He Val Leu Asn Pro Arg He 
485 490 495 

Met Ala Gin Asp Thr Gin Gin Ala Ser Phe Phe Val Gly Gin Thr Val 
500 505 510 

Pro Tyr Gin Thr He Lys Tyr Tyr He Gin Glu Thr Gly Thr Val Thr 
515 520 525 

Gin Asn lie Asp Tyr Glu Asp He Gly Val Asn Leu Val Val Thr Ser 
40 530 535 540 

Thr Val Ala Pro Asn Asn Val Val Thr Leu Gin He Glu Gin Thr He 
545 550 555 560 

Ser Glu Leu His Ser Ala Ser Gly Ser Leu Thr Pro Val Thr Asp Lys 
565 570 575 

Thr Tyr Ala Ala Thr Arg Leu Gin He Pro Asp Gly Cys Phe Leu Val 
580 585 590 

50 

Met Ser Gly His He Arg Asp Lys Thr Thr Lys Val Val Ser Gly Val 
595 600 605 

Pro Leu Leu Asn Ser He Pro Leu He Arg Gly Leu Phe Ser Arg Thr 
610 615 620 

He Asp Gin Arg Gin Lys Arg Asn He Met Met Phe He Lys Pro Lys 
625 630 635 640 



60 Val He Ser Ser Phe Glu Glu Gly Thr Arg Val Thr Asn Lys Glu Gly 
645 650 655 



1 
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Tyr Arg Tyr Asn Trp Glu Ala Asp Glu Gly Ser Met Gin Val Ala Pro 
660 665 670 

Arg His Ala Pro Glu Cys Gin Gly Pro Pro Ser Leu Gin Ala Glu Ser 
675 680 685 

Asp Phe Lys lie lie Glu lie Glu Ala Gin 
690 695 



<210> 38 
<211> 547 
<212> PRT 

<213> Chlamydia pneumoniae 
<400> 38 

Met Ser Arg Lys Asp Asn Glu Val Ser Leu Ala Arg Ser lie Phe Asn 
20 1 5 10 15 

lie Leu Ser Gly Thr Phe Cys Ser Arg lie Thr Gly lie Phe Arg Glu 
20 25 30 

lie Ala Met Ala Thr Tyr Phe Gly Ala Asp Pro lie Val Ala Ala Phe 
35 40 45 

Trp Leu Gly Phe Arg Thr Val Phe Phe Leu Arg Lys lie Leu Gly Gly 
50 55 60 

30 

Leu lie Leu Glu Gin Ala Phe lie Pro His Phe Glu Phe Leu Arg Ala 
65 70 75 80 

Gin Ser Leu Asp Arg Ala Ala Phe Phe Phe Arg Arg Phe Ser Arg Leu 
85 90 95 

lie Lys Gly Ser Thr lie lie Phe Thr Leu Leu lie Glu Ala Val Leu 
100 105 110 

4 0 Trp Val Phe Phe Asn Asn Val Glu Glu Gly Thr Tyr Asp Met lie Leu 
115 120 125 

Leu Thr Met He Leu Leu Pro Cys Gly He Phe Leu Met Met Tyr Asn 
130 135 140 

Val Asn Gly Ala Leu Leu His Cys Gly Asn Lys Phe Phe Gly Val Gly 
145 150 155 160 

Leu Ala Pro Val Val Val Asn He He Trp He Phe Phe Val He Ala 
50 165 170 175 

Ala Arg His Ser Asp Pro Arg Glu Arg He He Gly Leu Ser Val Ala 
180 185 190 

Leu Val He Gly Phe Phe Phe Glu Trp Leu He Thr Val Pro Gly Val 
195 200 205 

Trp Lys Phe Leu Leu Glu Ala Lys Ser Pro Pro Gin Glu His Asp Ser 
210 215 220 

60 
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Val Arg Ala Leu Leu Ala Pro Leu Ser Leu Gly lie Leu Thr Ser Ser 
225 230 235 240 

lie Phe Gin Leu Asn Leu Leu Ser Asp lie Cys Leu Ala Arg Tyr Val 
245 250 255 

His Glu lie Gly Pro Leu Tyr Leu Met Tyr Ser Leu Lys lie Tyr Gin 
10 260 265 270 

Leu Pro lie His Leu Phe Gly Phe Gly Val Phe Thr Val Leu Leu Pro 
275 280 285 

Ala lie Ser Arg Cys Val Gin Arg Glu Asp His Glu Arg Gly Leu Lys 
290 295 300 

Leu Met Lys Phe Val Leu Thr Leu Thr Met Ser Val Met lie He Met 
305 310 315 320 

20 

Thr Ala Gly Leu Leu Leu Leu Ala Leu Pro Gly Val Arg Val Leu Tyr 
325 330 335 

Glu His Gly Leu Phe Pro Gin Ser Ala Val Tyr Ala He Val Arg Val 
340 345 350 

Leu Arg Gly Tyr Gly Ala Ser lie lie Pro Met Ala Leu Ala Pro Leu 
355 360 365 

3 0 Val Ser Val Leu Phe Tyr Ala Gin Arg Gin Tyr Ala Val Pro Leu Phe 
370 375 380 

lie Gly He Gly Thr Ala Leu Ala Asn He Val Leu Ser Leu Val Leu 
385 390 395 400 

Gly Arg Trp Val Leu Lys Asp Val Ser Gly He Ser Tyr Ala Thr Ser 
405 410 415 

He Thr Ala Trp Val Gin Leu Tyr Phe Leu Trp Tyr Tyr Ser Ser Lys 
40 420 425 430 

Arg Leu Pro Met Tyr Ser Lys Leu Leu Trp Glu Ser He Arg Arg Ser 
435 440 445 

He Lys Val Met Gly Thr Thr Met Leu Ala Cys Met He Thr Leu Gly 
450 455 460 

Leu Asn He Leu Thr Gin Thr Thr Tyr Val He Phe Leu Asn Pro Leu 
465 470 475 480 

50 

Thr Pro Leu Ala Trp Pro Leu Ser Ser He Thr Ala Gin Ala He Ala 
485 490 495 

Phe Leu Ser Glu Ser Cys He Phe Leu Ala Phe Leu Phe Gly Phe Ala 
500 505 510 

Lys Leu Leu Arg Val Glu Asp Leu He Asn Leu Ala Ser Phe Glu Tyr 
515 520 525 



6 0 Trp Arg Gly Gin Arg Gly Leu Leu Gin Arg Gin His Val Met Gin Asp 
530 535 540 
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Thr Gin Asn 
545 



<210> 39 

<211> 535 

<212> PRT 

10 <213> Chlamydia pneumoniae 

<400> 39 

Arg Lys Asp Asn Glu Val Ser Leu Ala Arg Ser lie Phe Asn 
15 10 

13 e Leu Ser Gly Thr Phe Cys Ser Arg lie Thr Gly lie Phe Arg Glu 
15 20 25 30 

lie Ala Met Ala Thr Tyr Phe Gly Ala Asp Pro lie Val Ala Ala Phe 
20 35 40 45 

Trp Leu Gly Phe Arg Thr Val Phe Phe Leu Arg Lys lie Leu Gly Gly 
50 55 60 

Leu lie Leu Glu Gin Ala Phe lie Pro His Phe Glu Phe Leu Arg Ala 
65 70 75 

Gin Ser Leu Asp Arg Ala Ala Phe Phe Phe Arg Arg Phe Ser Arg Leu 
80 85 90 

30 

He Lys Gly Ser Thr He He Phe Thr Leu Leu lie Glu Ala Val Leu 
95 100 105 * 110 

Trp Val Phe Phe Asn Asn Val Glu Glu Gly Thr Tyr Asp Met lie Leu 
115 120 125 

Leu Thr Met He Leu Leu Pro Cys Gly He Phe Leu Met Met Tyr Asn 
130 135 140 

4 0 Val Asn Gly Ala Leu Leu His Cys Gly Asn Lys Phe Phe Gly Val Gly 
145 150 155 

Leu Ala Pro Val Val Val Asn He He Trp He Phe Phe Val He Ala 
160 165 170 

Ala Arg His Ser Asp Pro Arg Glu Arg He He Gly Leu Ser Val Ala 
175 180 185 190 

Leu Val He Gly Phe Phe Phe Glu Trp Leu He Thr Val Pro Gly Val 
50 195 200 205 

Trp Lys Phe Leu Leu Glu Ala Lys Ser Pro Pro Gin Glu His Asp Ser 
210 215 220 

Val Arg Ala Leu Leu Ala Pro Leu Ser Leu Gly He Leu Thr Ser Ser 
225 230 235 

He Phe Gin Leu Asn Leu Leu Ser Asp He Cys Leu Ala Arg Tyr Val 
240 245 250 

60 
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His Glu lie Gly Pro Leu Tyr Leu Met Tyr Ser Leu Lys lie Tyr Gin 
255 260 265 270 

Leu Pro lie His Leu Phe Gly Phe Gly Val Phe Thr Val Leu Leu Pro 
275 280 285 

Ala lie Ser Arg Cys Val Gin Arg Glu Asp His Glu Arg Gly Leu Lys 
10 290 295 300 

Leu Met Lys Phe Val Leu Thr Leu Thr Met Ser Val Met lie He Met 
305 310 315 

Thr Ala Gly Leu Leu Leu Leu Ala Leu Pro Gly Val Arg Val Leu Tyr 
320 325 330 

Glu His Gly Leu Phe Pro Gin Ser Ala Val Tyr Ala He Val Arg Val 
335 340 345 350 

20 

Leu Arg Gly Tyr Gly Ala Ser He He Pro Met Ala Leu Ala Pro Leu 
355 360 365 

Val Ser Val Leu Phe Tyr Ala Gin Arg Gin Tyr Ala Val Pro Leu Phe 
370 375 380 

He Gly He Gly Thr Ala Leu Ala Asn He Val Leu Ser Leu Val Leu 
385 390 395 

3 0 Gly Arg Trp Val Leu Lys Asp Val Ser Gly He Ser Tyr Ala Thr Ser 
400 405 410 

He Thr Ala Trp Val Gin Leu Tyr Phe Leu Trp Tyr Tyr Ser Ser Lys 
415 420 425 430 

Arg Leu Pro Met Tyr Ser Lys Leu Leu Trp Glu Ser He Arg Arg Ser 
435 440 445 

He Lys Val Met Gly Thr Thr Met Leu Ala Cys Met He Thr Leu Gly 
40 450 455 460 

Leu Asn He Leu Thr Gin Thr Thr Tyr Val He Phe Leu Asn Pro Leu 
465 470 475 

Thr Pro Leu Ala Trp Pro Leu Ser Ser He Thr Ala Gin Ala He Ala 
480 485 490 

Phe Leu Ser Glu Ser Cys He Phe Leu Ala Phe Leu Phe Gly Phe Ala 
495 500 505 510 

50 

Lys Leu Leu Arg Val Glu Asp Leu He Asn Leu Ala Ser Phe Glu Tyr 
515 520 525 

Trp Arg Gly Gin Arg Gly Leu Leu Gin 
530 535 



<210> 40 

<211> 954 

60 <212> PRT 

<213> Chlamydia 



pneumoniae 
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<400> 40 

Met Lys Thr Ser Arg Asn Lys Gin Cys Lys lie Thr Asp Pro Leu Ser 
15 10 15 

Lys Ser Ser Phe Phe Val Gly Ala Leu lie Leu Gly Lys Thr Thr lie 
20 25 30 

10 Leu Leu Asn Ala Thr Pro Leu Ser Asp Tyr Phe Asp Asn Gin Ala Asn 
35 40 45 

Gin Leu Thr Thr Leu Phe Pro Leu lie Asp Thr Leu Thr Asn Met Thr 
50 55 60 

Pro Tyr Ser His Arg Ala Thr Leu Phe Gly Val Arg Asp Asp Thr Asn 
65 70 75 80 

Gin Asp lie Val Leu Asp His Gin Asn Ser lie Glu Ser Trp Phe Glu 
20 85 90 95 

Asn Phe Ser Gin Asp Gly Gly Ala Leu Ser Cys Lys Ser Leu Ala lie 
100 105 110 

Thr Asn Thr Lys Asn Gin lie Leu Phe Leu Asn Ser Phe Ala lie Lys 
115 120 125 

Arg Ala Gly Ala Met Tyr Val Asp Gly Asn Phe Asp Leu Ser Glu Asn 
130 135 140 

30 

His Gly Ser lie lie Phe Ser Gly Asn Leu Ser Phe Pro Asn Ala Ser 
145 150 155 160 

Asn Phe Ala Asp Thr Cys Thr Gly Gly Ala Val Leu Cys Ser Lys Asn 
165 170 175 

Val Thr lie Ser Lys Asn Gin Gly Thr Ala Tyr Phe lie Asn Asn Lys 
180 185 190 

4 0 Ala Lys Ser Ser Gly Gly Ala lie Gin Ala Ala lie lie Asn lie Lys 
195 200 205 

Asp Asn Thr Gly Pro Cys Leu Phe Phe Asn Asn Ala Ala Gly Gly Thr 
210 215 220 

Ala Gly Gly Ala Leu Phe Ala Asn Ala Cys Arg lie Glu Asn Asn Ser 
225 230 235 240 

Gin Pro lie Tyr Phe Leu Asn Asn Gin Ser Gly Leu Gly Gly Ala lie 
50 245 250 255 

Arg Val His Gin Glu Cys lie Leu Thr Lys Asn Thr Gly Ser Val lie 
260 265 270 

Phe Asn Asn Asn Phe Ala Met Glu Ala Asp lie Ser Ala Asn His Ser 
275 280 285 

Ser Gly Gly Ala lie Tyr Cys lie Ser Cys Ser lie Lys Asp Asn Pro 
290 295 300 

60 
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Gly lie Ala Ala Phe Asp Asn Asn Thr Ala Ala Arg Asp Gly Gly Ala 
305 310 315 320 

lie Cys Thr Gin Ser Leu Thr lie Gin Asp Ser Gly Pro Val Tyr Phe 
325 330 335 

Thr Asn Asn Gin Gly Thr Trp Gly Gly Ala lie Met Leu Arg Gin Asp 
10 340 345 350 

Gly Ala Cys Thr Leu Phe Ala Asp Gin Gly Asp lie lie Phe Tyr Asn 
355 360 365 

Asn Arg His Phe Lys Asp Thr Phe Ser Asn His Val Ser Val Asn Cys 
370 375 380 

Thr Arg Asn Val Ser Leu Thr Val Gly Ala Ser Gin Gly His Ser Ala 
385 390 395 400 

20 

Thr Phe Tyr Asp Pro lie Leu Gin Arg Tyr Thr lie Gin Asn Ser lie 
405 410 415 

Gin Lys Phe Asn Pro Asn Pro Glu His Leu Gly Thr lie Leu Phe Ser 
420 425 430 

Ser Thr Tyr lie Pro Asp Thr Ser Thr Ser Arg Asp Asp Phe He Ser 
435 440 445 

3 0 His Phe Arg Asn His He Gly Leu Tyr Asn Gly Thr Leu Ala Leu Glu 
450 455 460 

Asp Arg Ala Glu Trp Lys Val Tyr Lys Phe Asp Gin Phe Gly Gly Thr 
465 470 475 480 

Leu Arg Leu Gly Ser Arg Ala Val Phe Ser Thr Thr Asp Glu Glu Gin 
485 490 495 

Ser Ser Ser Ser Val Gly Ser Val lie Asn lie Asn Asn Leu Ala He 
40 500 505 510 

Asn Leu Pro Ser He Leu Gly Asn Arg Val Ala Pro Lys Leu Trp He 
515 520 525 

Arg Pro Thr Gly Ser Ser Ala Pro Tyr Ser Glu Asp Asn Asn Pro He 
530 535 540 

He Asn Leu Ser Gly Pro Leu Ser Leu Leu Asp Asp Glu Asn Leu Asp 
545 550 555 560 

50 

Pro Tyr Asp Thr Ala Asp Leu Ala Gin Pro He Ala Glu Val Pro Leu 
565 570 575 

Leu Tyr Leu Leu Asp Val Thr Ala Lys His He Asn Thr Asp Asn Phe 
580 585 590 

Tyr Pro Glu Gly Leu Asn Thr Thr Gin His Tyr Gly Tyr Gin Gly Val 
595 600 605 



6 0 Trp Ser Pro Tyr Trp He Glu Thr He Thr Thr Ser Asp Thr Ser Ser 
610 615 620 
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Glu Asp Thr Val Asn Thr Leu His Arg Gin Leu Tyr Gly Asp Trp Thr 
625 630 635 640 

Pro Thr Gly Tyr Lys Val Asn Pro Glu Asn Lys Gly Asp lie Ala Leu 
645 650 655 

Ser Ala Phe Trp Gin Ser Phe His Asn Leu Phe Ala Thr Leu Arg Tyr 
10 660 665 670 

Gin Thr Gin Gin Gly Gin lie Ala Pro Thr Ala Ser Gly Glu Ala Thr 
675 680 685 

Arg Leu Phe Val His Gin Asn Ser Asn Asn Asp Ala Lys Gly Phe His 
690 695 700 

Met Glu Ala Thr Gly Tyr Ser Leu Gly Thr Thr Ser Asn Thr Ala Ser 
705 710 715 720 

20 

Asn His Ser Phe Gly Val Asn Phe Ser Gin Leu Phe Ser Asn Leu Tyr 
725 730 735 

Glu Ser His Ser Asp Asn Ser Val Ala Ser His Thr Thr Thr Val Ala 
740 745 750 

Leu Gin lie Asn Asn Pro Trp Leu Gin Glu Arg Phe Ser Thr Ser Ala 
755 760 765 

3 0 Ser Leu Ala Tyr Ser Tyr Ser Asn His His lie Lys Ala Ser Gly Tyr 
770 775 780 

Ser Gly Lys lie Gin Thr Glu Gly Lys Cys Tyr Ser Thr Thr Leu Gly 
785 790 795 800 

Ala Ala Leu Ser Cys Ser Leu Ser Leu Gin Trp Arg Ser Arg Pro Leu 
805 810 815 

His Phe Thr Pro Phe lie Gin Ala lie Ala Val Arg Ser Asn Gin Thr 
40 820 825 830 

Ala Phe Gin Glu Ser Gly Asp Lys Ala Arg Lys Phe Ser Val His Lys 
835 840 845 

Pro Leu Tyr Asn Leu Thr Val Pro Leu Gly lie Gin Ser Ala Trp Glu 
850 855 860 

Ser Lys Phe Arg Leu Pro Thr Tyr Trp Asn lie Glu Leu Ala Tyr Gin 
865 870 875 880 

50 

Pro Val Leu Tyr Gin Gin Asn Pro Glu lie Asn Val Ser Leu Glu Ser 
885 890 895 

Ser Gly Ser Ser Trp Leu Leu Ser Gly Thr Thr Leu Ala Arg Asn Ala 
900 905 910 

lie Ala Phe Lys Gly Arg Asn Gin lie Phe lie Phe Pro Lys Leu Ser 
915 920 925 

6 0 Val Phe Leu Asp Tyr Gin Gly Ser Val Ser Ser Ser Thr Thr Thr His 
930 935 940 
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Tyr Leu His Ala Gly Thr Thr Phe Lys Phe 
945 950 



<210> 41 
<211> 788 
<212> PRT 
10 <213> Chlamydia pneumoniae 

<400> 41 

Thr Gly Gly Ala Val Leu Cys Ser Lys Asn 
15 10 

Val Thr lie Ser Lys Asn Gin Gly Thr Ala Tyr Phe lie Asn Asn Lys 
15 20 25 

Ala Lys Ser Ser Gly Gly Ala lie Gin Ala Ala lie He Asn He Lys 
20 30 35 40 

Asp Asn Thr Gly Pro Cys Leu Phe Phe Asn Asn Ala Ala Gly Gly Thr 
45 50 55 

Ala Gly Gly Ala Leu Phe Ala Asn Ala Cys Arg He Glu Asn Asn Ser 
60 65 70 

Gin Pro He Tyr Phe Leu Asn Asn Gin Ser Gly Leu Gly Gly Ala He 
75 80 85 90 

30 

Arg Val His Gin Glu Cys He Leu Thr Lys Asn Thr Gly Ser Val He 
95 100 105 

Phe Asn Asn Asn Phe Ala Met Glu Ala Asp He Ser Ala Asn His Ser 
110 115 120 

Ser Gly Gly Ala He Tyr Cys He Ser Cys Ser He Lys Asp Asn Pro 
125 130 135 

4 0 Gly He Ala Ala Phe Asp Asn Asn Thr Ala Ala Arg Asp Gly Gly Ala 
140 145 150 

He Cys Thr Gin Ser Leu Thr He Gin Asp Ser Gly Pro Val Tyr Phe 
155 160 165 170 

Thr Asn Asn Gin Gly Thr Trp Gly Gly Ala He Met Leu Arg Gin Asp 
175 180 185 

Gly Ala Cys Thr Leu Phe Ala Asp Gin Gly Asp He He Phe Tyr Asn 
50 190 195 200 

Asn Arg His Phe Lys Asp Thr Phe Ser Asn His Val Ser Val Asn Cys 
205 210 215 

Thr Arg Asn Val Ser Leu Thr Val Gly Ala Ser Gin Gly His Ser Ala 
220 225 230 

Thr Phe Tyr Asp Pro He Leu Gin Arg Tyr Thr He Gin Asn Ser He 
235 240 245 250 
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Gin Lys Phe Asn Pro Asn Pro Glu His Leu Gly Thr lie Leu Phe Ser 
255 260 265 

Ser Thr Tyr lie Pro Asp Thr Ser Thr Ser Arg Asp Asp Phe lie Ser 
270 275 280 

His Phe Arg Asn His lie Gly Leu Tyr Asn Gly Thr Leu Ala Leu Glu 
10 285 290 295 

Asp Arg Ala Glu Trp Lys Val Tyr Lys Phe Asp Gin Phe Gly Gly Thr 
300 305 310 

Leu Arg Leu Gly Ser Arg Ala Val Phe Ser Thr Thr Asp Glu Glu Gin 
315 320 325 330 

Ser Ser Ser Ser Val Gly Ser Val lie Asn lie Asn Asn Leu Ala lie 
335 340 345 

20 

Asn Leu Pro Ser lie Leu Gly Asn Arg Val Ala Pro Lys Leu Trp lie 
350 355 360 

Arg Pro Thr Gly Ser Ser Ala Pro Tyr Ser Glu Asp Asn Asn Pro lie 
365 370 375 

lie Asn Leu Ser Gly Pro Leu Ser Leu Leu Asp Asp Glu Asn Leu Asp 
380 385 390 

3 0 Pro Tyr Asp Thr Ala Asp Leu Ala Gin Pro lie Ala Glu Val Pro Leu 
395 400 405 410 

Leu Tyr Leu Leu Asp Val Thr Ala Lys His lie Asn Thr Asp Asn Phe 
415 420 425 

Tyr Pro Glu Gly Leu Asn Thr Thr Gin His Tyr Gly Tyr Gin Gly Val 
430 435 440 

Trp Ser Pro Tyr Trp lie Glu Thr lie Thr Thr Ser Asp Thr Ser Ser 
40 445 450 455 

Glu Asp Thr Val Asn Thr Leu His Arg Gin Leu Tyr Gly Asp Trp Thr 
460 465 470 

Pro Thr Gly Tyr Lys Val Asn Pro Glu Asn Lys Gly Asp lie Ala Leu 
475 480 485 490 

Ser Ala Phe Trp Gin Ser Phe His Asn Leu Phe Ala Thr Leu Arg Tyr 
495 500 505 

50 

Gin Thr Gin Gin Gly Gin lie Ala Pro Thr Ala Ser Gly Glu Ala Thr 
510 515 520 

Arg Leu Phe Val His Gin Asn Ser Asn Asn Asp Ala Lys Gly Phe His 
525 530 535 

Met Glu Ala Thr Gly Tyr Ser Leu Gly Thr Thr Ser Asn Thr Ala Ser 
540 545 550 



6 0 Asn His Ser Phe Gly Val Asn Phe Ser Gin Leu Phe Ser Asn Leu Tyr 
555 560 565 570 
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Glu Ser His Ser Asp Asn Ser Val Ala Ser His Thr Thr Thr Val Ala 
575 580 585 

Leu Gin lie Asn Asn Pro Trp Leu Gin Glu Arg Phe Ser Thr Ser Ala 
590 595 600 

Ser Leu Ala Tyr Ser Tyr Ser Asn His His lie Lys Ala Ser Gly Tyr 
10 605 610 615 

Ser Gly Lys lie Gin Thr Glu Gly Lys Cys Tyr Ser Thr Thr Leu Gly 
620 625 630 

Ala Ala Leu Ser Cys Ser Leu Ser Leu Gin Trp Arg Ser Arg Pro Leu 
635 640 645 650 

His Phe Thr Pro Phe lie Gin Ala lie Ala Val Arg Ser Asn Gin Thr 
655 660 665 

20 

Ala Phe Gin Glu Ser Gly Asp Lys Ala Arg Lys Phe Ser Val His Lys 
670 675 680 

Pro Leu Tyr Asn Leu Thr Val Pro Leu Gly lie Gin Ser Ala Trp Glu 
685 690 695 

Ser Lys Phe Arg Leu Pro Thr Tyr Trp Asn lie Glu Leu Ala Tyr Gin 
700 705 710 

3 0 Pro Val Leu Tyr Gin Gin Asn Pro Glu lie Asn Val Ser Leu Glu Ser 
715 720 725 730 

Ser Gly Ser Ser Trp Leu Leu Ser Gly Thr Thr Leu Ala Arg Asn Ala 
735 740 745 

lie Ala Phe Lys Gly Arg Asn Gin lie Phe lie Phe Pro Lys Leu Ser 
750 755 760 

Val Phe Leu Asp Tyr Gin Gly Ser Val Ser Ser Ser Thr Thr Thr His 
40 765 770 775 

Tyr Leu His Ala Gly Thr Thr Phe Lys Phe 
780 785 



<210> 42 
<211> 1000 
<212> PRT 

<213> Chlamydia pneumoniae 
<400> 42 

Met Gin Val Phe Pro Lys Val Thr Leu Ser Leu Asp Tyr Ser Ala Asp 
15 10 15 

lie Ser Ser Ser Thr Leu Ser His Tyr Leu Asn Val Ala Ser Arg Met 
20 25 30 



Arg Phe Leu Thr lie Ser Asp Gin Asn Arg Lys lie Lys Glu Pro Leu 
35 40 45 

60 
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Val Ser Lys 
50 

Ala Cys Met 
65 

Ser Leu Glu 

10 

Phe Pro Leu 



Thr Thr Phe 
115 

Ser Asn Tyr 
130 

20 

Gly Gly Ala 
145 

His Ala Phe 



Ala Cys Gin 



3 0 Phe Phe Ser 
195 

Arg Gly Gly 
210 

Gin Gly Thr 
225 

Ala Leu Ser 

40 

Leu Leu Phe 



Ser Glu Asn 
275 

Asn Asn Cys 
290 

50 

Ala lie Lys 

305 

Leu Ser Gly 



Thr Thr Asn 



6 0 Asn Asn Tyr 
355 



Thr Pro Pro Lys 

55 

Phe Gly Met Thr 
70 

Lys Phe Ala Leu 
85 

Leu Asp Ser Leu 
100 

Val Gly Asn Arg 



Lys Ser lie Asp 
135 

Val Ser Cys Asn 
150 

Phe Ser Lys Asn 
165 

Gly Ala Cys Thr 
180 

Asn Arg Gly Leu 



Ala lie Ala Cys 
215 

Phe Tyr Phe Val 
230 

Thr Asn Gly His 
245 

Phe Asn Asn Thr 
260 

Thr Thr lie Ser 



Gly Asn Asn Gly 
295 

Asn Asn Ser Gly 
310 

Ser lie Asn Ser 
325 

Leu Ser lie Asp 
340 

Cys lie Arg Asp 



Phe Leu Phe Tyr Leu 
60 

Pro Ala Val Tyr Ser 
75 

Glu Arg Asp Glu Glu 
90 

Ser Thr Leu Thr Gly 
105 

His Asn Ser Ser Gin 
120 

Asn lie Leu Leu Leu 
140 

Asn Phe Leu Leu Ser 
155 

Leu Ala lie Gly Thr 
170 

lie Thr Lys Asn Arg 
185 

Asn Asn Ala Ser Thr 
200 

Asn Gly Asp Phe Thr 
220 

Asn Asn Ser Val Asn 
235 

Cys Arg lie Gin Ser 
250 

Ala Pro Ser Gly Gly 
265 

Asp Asn Thr Arg Pro 
280 

Gly Ala lie Gin Thr 
300 

Ser Val lie Phe Asn 

315 

Gly Asn Gly Ser Gly 
330 

Asp Asn Pro Gly Thr 
345 

Gly Gly Ala lie Cys 
360 



Gly Asn Phe Thr 



Leu Gin Thr Asp 
80 

Phe Arg Thr Ser 
95 

Phe Ser Pro lie 
110 

Asp lie Val Leu 
125 

Trp Thr Ser Ala 



Asn Val Glu Asp 
160 

Gly Gly Ala lie 
175 

Gly Pro Leu lie 
190 

Gly Gly Glu Thr 
205 

lie Ser Gin Asn 



Asn Trp Gly Gly 
240 

Asn Arg Ala Pro 
255 

Gly Ala Leu Arg 
270 

lie Tyr Phe Lys 
285 

Ser Val Thr Val 



Asn Asn Thr Ala 
320 

Gly Ala lie Tyr 
335 

lie Leu Phe Asn 
350 

Thr Gin Phe Leu 
365 
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Thr lie Lys 
370 

Trp Gly Gly 
385 

Ala Glu Gin 

10 

Thr Phe Gly 



Gin Leu Gly 
435 

Glu His Gin 
450 

20 

Asn His Gin 
465 

Ser Asp Tyr 



Leu Arg Asn 



3 0 Tyr Lys Phe 
515 

Ser lie Ala 
530 

Ser Gin Val 
545 

Ala Lys Gly 

40 

Ala Pro Phe 



Leu Thr Leu 
595 

Leu Ser Glu 
610 

50 

Thr Ala Arg 
625 

Ala Thr Glu 



Glu Thr lie 



6 0 Leu Tyr Arg 
675 



Asn Ser Gly His 
375 

Ala Leu Met Leu 
390 

Gly Asn lie Ala 
405 

Arg Tyr Asn Ala 
420 

Ala Asn Lys Gly 



His Pro Thr Thr 
455 

Gly Thr lie Leu 
470 

Glu Asn Asn Phe 
485 

Gly Val Leu Ser 
500 

Thr Gin Lys Gly 



Thr Thr Ala Asn 
535 

lie lie Asn Asn 
550 

Lys Ala Pro Thr 
565 

Thr Glu Asp Asn 
580 

Leu Asn Glu Glu 



Pro Leu Gin Asn 
615 

His lie Asn Thr 
630 

His Tyr Gly Tyr 
645 

Thr Thr Thr Asn 
660 

Ala Leu Tyr Ala 



Val Tyr Phe Thr Asn 
380 

Leu Gin Asp Ser Thr 
395 

Phe Gin Asn Asn Glu 
410 

lie His Cys Thr Pro 
425 

Tyr Thr Thr Ala Phe 
440 

Asn Pro Leu lie Phe 
460 

Phe Ser Ser Ala Tyr 
475 

lie Ser Ser Ser Lys 
490 

lie Glu Asp Arg Ala 
505 

Gly lie Leu Lys Leu 
520 

Ser Glu Thr Pro Ser 
540 

Leu Ala lie Asn Leu 
555 

Leu Trp lie Arg Pro 
570 

Asn Pro Thr lie Thr 
585 

Asn Arg Asp Pro Tyr 
600 

lie His Leu Leu Ser 
620 

Asp Asn Phe His Pro 
635 

Gin Gly lie Trp Ser 
650 

Asn Ala Ser lie Glu 
665 

Asn Trp Thr Pro Leu 
680 



Asn Gin Gly Asn 



Cys Leu Leu Phe 
400 

Val Phe Leu Thr 
415 

Asn Ser Asn Leu 
430 

Phe Asp Pro lie 
445 

Asn Pro Asn Ala 



lie Pro Glu Ala 
480 

Asn Thr Ser Glu 
495 

Gly Trp Gin Phe 
510 

Gly His Ala Ala 
525 

Thr Ser Val Gly 



Pro Ser lie Leu 
560 

Leu Gin Ser Ser 
575 

Leu Ser Gly Pro 
590 

Asp Ser lie Asp 
605 

Leu Ser Asp Val 



Glu Ser Leu Asn 
640 

Pro Tyr Trp Val 
655 

Thr Ala Asn Thr 
670 

Gly Tyr Lys Val 
685 
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Asn Pro Glu Tyr 
690 

Phe His Thr Met 
705 

Ser Asp lie Glu 

10 

Leu Phe Val His 
740 

Gin Ser Thr Gly 
755 

Gin Lys lie Ser 
770 

20 

lie Gly Ser Ser 
785 

Tyr Val Glu Leu 



Leu Ala Tyr Gly 
820 



Gin Gly Asp Leu 
695 

Phe Ser Leu Leu 
710 

Arg Pro Phe Leu 
725 

Gin Asn Ser lie 



Tyr Ser Leu Gin 
760 

Leu Gly Phe Ala 
775 

Asn Asn Val Ser 
790 

Pro Trp Phe Gin 
805 

Tyr Gly Asp His 



Ala Thr Thr Pro 
700 

Arg Ser Tyr Asn 
715 

Glu lie Gin Gly 
730 

Pro Gly Ala Pro 
745 

Ala Ser Ser Glu 



Gin Phe Phe Thr 
780 

Ala His Asn Thr 
795 

Glu Ala Phe Ala 
810 

His Leu His Ala 
825 



Leu Trp Gin Ser 



Arg Thr Gly Asp 
720 

lie Ala Asp Gly 
735 

Gly Phe Arg lie 
750 

Thr Ser Leu His 
765 

Arg Thr Lys Glu 



Val Ser Ser Leu 
800 

Thr Ser His Ser 
815 

Tyr lie Arg His 
830 



3 0 lie Lys Asn Arg Ala Glu Gly Thr 
835 840 

Ala lie Gly Cys Ser Phe Pro Trp 
850 855 

Ser Pro Phe Val Gin Ala lie Ala 
865 870 

Glu Glu lie Gly Asp Asn Pro Arg 
40 885 



Cys Tyr Ser His Thr Leu Ala Ala 
845 

Gin Gin Lys Ser Tyr Leu His Leu 
860 

lie Arg Ser His Gin Thr Ala Phe 
875 880 

Lys Phe Val Ser Gin Lys Pro Phe 
890 895 



Tyr Asn Leu Thr Leu Pro Leu Gly lie Gin Gly Lys Trp Gin Ser Lys 
900 905 910 

Phe His Val Pro Thr Glu Trp Thr Leu Glu Leu Ser Tyr Gin Pro Val 

915 920 925 



Leu Tyr Gin Gin Asn Pro Gin lie Gly Val Thr Leu Leu Ala Ser Gly 

930 935 940 

Gly Ser Trp Asp lie Leu Gly His Asn Tyr Val Arg Asn Ala Leu Gly 

945 950 955 960 



Tyr Lys Val His 

Leu Asp Tyr Gin 
980 



Asn Gin Thr Ala 
965 

Gly Ser Val Ser 



Leu Phe Arg Ser 
970 

Ser Ser Thr Ser 
985 



Leu Asp Leu Phe 
975 

Thr His His Leu 
990 



6 0 Gin Ala Gly Ser Thr Leu Lys Phe 
995 1000 
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<210> 43 

<211> 931 

<212> PRT 

<213> Chlamydia pneumoniae 

<400> 43 

Met Leu Leu Pro Phe Thr Phe Val Leu Ala Asn Glu Gly Leu Gin Leu 
10 1 5 10 15 

Pro Leu Glu Thr Tyr lie Thr Leu Ser Pro Glu Tyr Gin Ala Ala Pro 
20 25 30 

Gin Val Gly Phe Thr His Asn Gin Asn Gin Asp Leu Ala lie Val Gly 
35 40 45 

Asn His Asn Asp Phe lie Leu Asp Tyr Lys Tyr Tyr Arg Ser Asn Gly 
50 55 60 

20 

Gly Ala Leu Thr Cys Lys Asn Leu Leu lie Ser Glu Asn lie Gly Asn 
65 70 75 80 

Val Phe Phe Glu Lys Asn Val. Cys Pro Asn Ser Gly Gly Ala lie Tyr 
85 90 95 

Ala Ala Gin Asn Cys Thr lie Ser Lys Asn Gin Asn Tyr Ala Phe Thr 
100 105 110 

3 0 Thr Asn Leu Val Ser Asp Asn Pro Thr Ala Thr Ala Gly Ser Leu Leu 
115 120 125 

Gly Gly Ala Leu Phe Ala lie Asn Cys Ser lie Thr Asn Asn Leu Gly 
130 135 140 

Gin Gly Thr Phe Val Asp Asn Leu Ala Leu Asn Lys Gly Gly Ala Leu 
145 150 155 160 

Tyr Thr Glu Thr Asn Leu Ser lie Lys Asp Asn Lys Gly Pro lie lie 
40 165 170 175 

lie Lys Gin Asn Arg Ala Leu Asn Ser Asp Ser Leu Gly Gly Gly lie 
180 185 190 

Tyr Ser Gly Asn Ser Leu Asn lie Glu Gly Asn Ser Gly Ala lie Gin 
195 200 205 

lie Thr Ser Asn Ser Ser Gly Ser Gly Gly Gly lie Phe Ser Thr Gin 
210 215 220 

50 

Thr Leu Thr lie Ser Ser Asn Lys Lys Leu lie Glu lie Ser Glu Asn 
225 230 235 240 

Ser Ala Phe Ala Asn Asn Tyr Gly Ser Asn Phe Asn Pro Gly Gly Gly 
245 250 255 

Gly Leu Thr Thr Thr Phe Cys Thr lie Leu Asn Asn Arg Glu Gly Val 
260 265 270 

6 0 Leu Phe Asn Asn Asn Gin Ser Gin Ser Asn Gly Gly Ala lie His Ala 
275 280 285 



O '9i B :$ O li f- ii '1HiS ,, Q 5 Ei! SMO iF:!: 



76/83 



Lys Ser lie 
290 

Thr Ala Thr 
305 

Asn Gly Ser 

10 

Asn Asn Thr 



lie His Ser 
355 

Tyr Arg Val 
370 

20 

Phe Pro lie 
385 

Phe Ser Gly 



Phe Ser Tyr 



3 0 Val Glu Asp 
435 

Gly Thr Leu 
450 

lie Pro Thr 
465 

Leu Asn His 

40 

Gin Ala Pro 



Thr Glu Asp 
515 

Arg Asn Ser 
530 

50 

Ser Leu Glu 
545 

Lys He Asn 



His Tyr Gly 



60 Thr He Thr 
595 



He He Lys Glu Asn 
295 

Arg Gly Gly Ala Leu 
310 

Phe He Leu Ser Ala 

325 

Ala Ser Lys His Ala 
340 

Thr Pro Asn Met Asn 
360 

Leu Phe Tyr Asp Pro 
375 

Leu Phe Asn Phe Glu 
390 

Glu His Val His Gin 
405 

Leu Arg Asn Thr Ser 
420 

Gly Ala Gly Leu Ala 
440 

Leu Leu Gly Gin Gly 
455 

Pro Ser Ser Thr Pro 
470 

He Ala He Asp Leu 
485 

Lys He Trp He Tyr 
500 

Ser Asn Pro Thr He 
520 

Asn Asn Glu Asp Pro 
535 

Lys Val Pro Leu Leu 
550 

Ser Ser Gin Leu Asp 
565 

Tyr Gin Gly He Trp 
580 

Asn Pro Thr Ser Leu 
600 



Gly Pro Val Tyr 
300 

Leu Asn Leu Ser 
315 

Asp Asn Gly Asp 
330 

Leu Asn Pro Pro 
345 

Leu Gin He Gly 



He Glu His Glu 
380 

Thr Gly His Thr 
395 

Asn Phe Thr Asp 
410 

Glu Leu Arg Gin 
425 

Cys Tyr Lys Phe 



Ala Val He Thr 
460 

Thr Thr Val Gly 

475 

Pro Ser He Leu 
490 

Pro Thr Lys Thr 
505 

Thr He Ser Gly 



Tyr Asp Ser Leu 
540 

Tyr He Val Asp 
555 

Leu Ser Thr Leu 
570 

Ser Thr Tyr Trp 
585 

Leu Gly Ala Asn 



Phe Leu Asn Asn 



Ala Gly Ser Gly 
320 

He He Phe Asn 
335 

Tyr Arg Asn Ala 
350 

Ala Arg Pro Gly 
365 

Leu Pro Ser Ser 



Gly Thr Val Leu 
400 

Glu Met Asn Phe 
415 

Gly Val Leu Ala 
430 

Phe Gin Arg Gly 
445 

Thr Ala Gly Thr 



Ser Thr He Thr 
480 

Ser Phe Gin Ala 
495 

Gly Ser Thr Tyr 
510 

Thr Leu Thr Leu 
525 

Asp Leu Ser His 



Val Ala Ala Gin 
560 

Asn Ser Gly Glu 
575 

Val Glu Thr Thr 
590 

Thr Lys His Lys 
605 
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Leu Leu Tyr 
610 

Arg Arg Gly 
625 

Ala Leu Ala 

10 

His Ala Ala 



Lys Asn Gly 
675 

Thr Thr Glu 
690 

20 

Ala Gin Phe 
705 

Ser Ser His 



Gin Arg Val 



3 0 Glu His Thr 
755 

Gly Ser Phe 
770 

Leu Pro Gin 
785 

Ala Leu Ala 

40 

His Ala Arg 



Pro Val Gly 
835 

Val Trp Leu 
850 

50 

Pro Glu Leu 
865 

Gin Ala Thr 



Thr Met Gin 



6 0 Asp lie Ser 
915 



Ala Asn Trp Ser 
615 

Glu Phe lie Thr 
630 

Gly Leu His Ser 
645 

Ser Leu Gin Gly 
660 

Phe Lys Gly Phe 



Ala Thr Ser Ser 
695 

Phe Ser Lys Ala 
710 

His Tyr Phe Ser 
725 

lie Arg Leu Ser 
740 

His Thr Met Tyr 



His Asn His Thr 
775 

Pro His Gly Glu 
790 

lie Arg Gly Asn 
805 

Glu Phe Ser Leu 
820 

lie Arg Ala Ser 



Thr Glu lie Ser 
855 

His Ser Lys Leu 
870 

Pro Val Thr Tyr 
885 

Val Phe Pro Lys 
900 

Ser Ser Thr Leu 



Pro Leu Gly Tyr 



Asn Ala Leu Trp 
635 

Leu Ser Ser Trp 
650 

lie Gly Leu Leu 
665 

Arg Ser His Met 
680 

Gin Ser Pro Asn 



Lys Glu His Glu 
715 

Gly Met Cys lie 
730 

Val Ser Leu Ala 
745 

Gin Gly Leu Leu 
760 

Leu Ala Gly Ala 



Ser Leu Gin lie 
795 

Leu Ala Ala Phe 
810 

His Arg Pro Leu 
825 

Trp Lys Asn His 
840 

Tyr Arg Ser Thr 



Leu lie Ser Gin 
875 

Asn Ala Leu Gly 
890 

Val Thr Leu Ser 
905 

Ser His Tyr Leu 
92 0 



Arg Pro His Pro Glu 
620 

Gin Ser Ala Tyr Thr 
640 

Asp Glu Glu Lys Gly 
655 

Val His Gin Lys Asp 
670 

Thr Gly Tyr Ser Ala 
685 

Phe Ser Leu Gly Phe 
700 

Ser Gin Asn Ser Thr 
720 

Ala Lys Tyr Ser Leu 
735 

Tyr Met Phe Thr Ser 
750 

Glu Gly Asn Ser Gin 
765 

Leu Ser Cys Val Phe 
780 

Tyr Pro Phe lie Thr 
800 

Gin Glu Ser Gly Asp 
815 

Thr Asp Val Ser Leu 
830 

His Arg Val Pro Leu 
845 

Leu Tyr Arg Gin Asp 
860 

Gly Thr Trp Thr Thr 
880 

lie Lys Val Lys Asn 
895 

Leu Asp Tyr Ser Ala 
910 

Asn Val Ala Ser Arg 
925 
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Met Arg Phe 
930 



<210> 44 
<211> 978 
<212> PRT 
10 <213> Chlamydia pneumoniae 

<400> 44 

Met Pro Leu Ser Phe Lys Ser Ser Ser Phe Cys Leu Leu Ala Cys Leu 
15 10 15 

Cys Ser Ala Ser Cys Ala Phe Ala Glu Thr Arg Leu Gly Gly Asn Phe 
20 25 30 

Val Pro Pro lie Thr Asn Gin Gly Glu Glu lie Leu Leu Thr Ser Asp 
20 35 40 45 

Phe Val Cys Ser Asn Phe Leu Gly Ala Ser Phe Ser Ser Ser Phe lie 
50 55 60 

Asn Ser Ser Ser Asn Leu Ser Leu Leu Gly Lys Gly Leu Ser Leu Thr 
65 70 75 80 

Phe Thr Ser Cys Gin Ala Pro Thr Asn Ser Asn Tyr Ala Leu Leu Ser 
85 90 95 

30 

Ala Ala Glu Thr Leu Thr Phe Lys Asn Phe Ser Ser lie Asn Phe Thr 
100 105 110 

Gly Asn Gin Ser Thr Gly Leu Gly Gly Leu lie Tyr Gly Lys Asp lie 
115 120 125 

Val Phe Gin Ser lie Lys Asp Leu lie Phe Thr Thr Asn Arg Val Ala 
130 135 140 

4 0 Tyr Ser Pro Ala Ser Val Thr Thr Ser Ala Thr Pro Ala lie Thr Thr 
145 150 155 160 

Val Thr Thr Gly Ala Ser Ala Leu Gin Pro Thr Asp Ser Leu Thr Val 
165 170 175 

Glu Asn lie Ser Gin Ser lie Lys Phe Phe Gly Asn Leu Ala Asn Phe 
180 185 190 

Gly Ser Ala lie Ser Ser Ser Pro Thr Ala Val Val Lys Phe lie Asn 
50 195 200 205 

Asn Thr Ala Thr Met Ser Phe Ser His Asn Phe Thr Ser Ser Gly Gly 
210 215 220 

Gly Val lie Tyr Gly Gly Ser Ser Leu Leu Phe Glu Asn Asn Ser Gly 
225 230 235 240 

Cys lie lie Phe Thr Ala Asn Ser Cys Val Asn Ser Leu Lys Gly Val 
245 250 255 

60 
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Thr Pro Ser Ser Gly Thr Tyr Ala Leu Gly Ser Gly Gly Ala lie Cys 
260 265 270 

lie Pro Thr Gly Thr Phe Glu Leu Lys Asn Asn Gin Gly Lys Cys Thr 
275 280 285 

Phe Ser Tyr Asn Gly Thr Pro Asn Asp Ala Gly Ala lie Tyr Ala Glu 
10 290 295 300 

Thr Cys Asn lie Val Gly Asn Gin Gly Ala Leu Leu Leu Asp Ser Asn 
305 310 315 320 

Thr Ala Ala Arg Asn Gly Gly Ala lie Cys Ala Lys Val Leu Asn lie 
325 330 335 

Gin Gly Arg Gly Pro lie Glu Phe Ser Arg Asn Arg Ala Glu Lys Gly 
340 345 350 

20 

Gly Ala lie Phe lie Gly Pro Ser Val Gly Asp Pro Ala Lys Gin Thr 
355 360 365 

Ser Thr Leu Thr lie Leu Ala Ser Glu Gly Asp lie Ala Phe Gin Gly 
370 375 380 

Asn Met Leu Asn Thr Lys Pro Gly He Arg Asn Ala lie Thr Val Glu 
385 390 395 400 

3 0 Ala Gly Gly Glu He Val Ser Leu Ser Ala Gin Gly Gly Ser Arg Leu 
405 410 415 

Val Phe Tyr Asp Pro He Thr His Ser Leu Pro Thr Thr Ser Pro Ser 
420 425 430 

Asn Lys Asp lie Thr lie Asn Ala Asn Gly Ala Ser Gly Ser Val Val 
435 440 445 

Phe Thr Ser Lys Gly Leu Ser Ser Thr Glu Leu Leu Leu Pro Ala Asn 
40 450 455 460 

Thr Thr Thr lie Leu Leu Gly Thr Val Lys lie Ala Ser Gly Glu Leu 
465 470 475 480 

Lys lie Thr Asp Asn Ala Val Val Asn Val Ala Gly Phe Ala Thr Gin 
485 490 495 

Gly Ser Gly Gin Leu Thr Leu Gly Ser Gly Gly Thr Leu Gly Leu Ala 
500 505 510 

50 

Thr Pro Thr Gly Ala Pro Ala Ala Val Asp Phe Thr lie Gly Lys Leu 
515 520 525 

Ala Phe Asp Pro Phe Ser Phe Leu Lys Arg Asp Phe Val Ser Ala Ser 
530 535 540 

Val Asn Ala Gly Thr Lys Asn Val Thr Leu Thr Gly Ala Leu Val Leu 
545 550 555 560 



6 0 Asp Glu His Asp Val Thr Asp Leu Tyr Asp Met Val Ser Leu Gin Ser 
565 570 575 
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Pro Val Ala lie Pro lie Ala Val Phe Lys Gly Ala Thr Val Thr Lys 
580 585 590 

Thr Gly Phe Pro Asp Gly Glu lie Ala Thr Pro Ser His Tyr Gly Tyr 
595 600 605 

Gin Gly Lys Trp Ser Tyr Thr Trp Ser Arg Pro Leu Leu lie Pro Ala 
10 610 615 620 

Pro Asp Gly Gly Phe Pro Gly Gly Pro Ser Pro Ser Ala Asn Thr Leu 
625 630 635 640 

Tyr Ala Val Trp Asn Ser Asp Thr Leu Val Arg Ser Thr Tyr lie Leu 
645 650 655 

Asp Pro Glu Arg Tyr Gly Glu lie Val Ser Asn Ser Leu Trp lie Ser 
660 665 670 

20 

Phe Leu Gly Asn Gin Ala Phe Ser Asp lie Leu Gin Asp Val Leu Leu 
675 680 685 

lie Asp His Pro Gly Leu Ser lie Thr Ala Lys Ala Leu Gly Ala Tyr 
690 695 700 

Val Glu His Thr Pro Arg Gin Gly His Glu Gly Phe Ser Gly Arg Tyr 
705 710 715 720 

3 0 Gly Gly Tyr Gin Ala Ala Leu Ser Met Asn Tyr Thr Asp His Thr Thr 
725 730 735 

Leu Gly Leu Ser Phe Gly Gin Leu Tyr Gly Lys Thr Asn Ala Asn Pro 
740 745 750 

Tyr Asp Ser Arg Cys Ser Glu Gin Met Tyr Leu Leu Ser Phe Phe Gly 
755 760 765 

Gin Phe Pro lie Val Thr Gin Lys Ser Glu Ala Leu lie Ser Trp Lys 
40 770 775 780 

Ala Ala Tyr Gly Tyr Ser Lys Asn His Leu Asn Thr Thr Tyr Leu Arg 
785 790 795 800 

Pro Asp Lys Ala Pro Lys Ser Gin Gly Gin Trp His Asn Asn Ser Tyr 
805 810 815 

Tyr Val Leu lie Ser Ala Glu His Pro Phe Leu Asn Trp Cys Leu Leu 
820 825 830 

50 

Thr Arg Pro Leu Ala Gin Ala Trp Asp Leu Ser Gly Phe lie Ser Ala 
835 840 845 

Glu Phe Leu Gly Gly Trp Gin Ser Lys Phe Thr Glu Thr Gly Asp Leu 
850 855 860 

Gin Arg Ser Phe Ser Arg Gly Lys Gly Tyr Asn Val Ser Leu Pro lie 
865 870 875 880 



6 0 Gly Cys Ser Ser Gin Trp Phe Thr Pro Phe Lys Lys Ala Pro Ser Thr 
885 890 895 
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Leu Thr lie Lys Leu Ala Tyr Lys Pro Asp lie Tyr Arg Val Asn Pro 
900 905 910 

His Asn lie Val Thr Val Val Ser Asn Gin Glu Ser Thr Ser lie Ser 
915 920 925 

Gly Ala Asn Leu Arg Arg His Gly Leu Phe Val Gin lie His Asp Val 
10 930 935 940 

Val Asp Leu Thr Glu Asp Thr Gin Ala Phe Leu Asn Tyr Thr Phe Asp 
945 950 955 960 

Gly Lys Asn Gly Phe Thr Asn His Arg Val Ser Thr Gly Leu Lys Ser 
965 970 975 

Thr Phe 

20 

<210> 45 
<211> 813 
<212> PRT 

<213> Chlamydia pneumoniae 
<400> 45 

Ser Ala Leu Gin Pro Thr Asp Ser Leu Thr Val 
15 10 

3 0 Glu Asn lie Ser Gin Ser lie Lys Phe Phe Gly Asn Leu Ala Asn Phe 
15 20 25 

Gly Ser Ala lie Ser Ser Ser Pro Thr Ala Val Val Lys Phe lie Asn 
30 35 40 

Asn Thr Ala Thr Met Ser Phe Ser His Asn Phe Thr Ser Ser Gly Gly 
45 50 55 

Gly Val lie Tyr Gly Gly Ser Ser Leu Leu Phe Glu Asn Asn Ser Gly 
40 60 65 70 75 

Cys lie lie Phe Thr Ala Asn Ser Cys Val Asn Ser Leu Lys Gly Val 
80 85 90 

Thr Pro Ser Ser Gly Thr Tyr Ala Leu Gly Ser Gly Gly Ala lie Cys 
95 100 105 

lie Pro Thr Gly Thr Phe Glu Leu Lys Asn Asn Gin Gly Lys Cys Thr 
110 115 120 

50 

Phe Ser Tyr Asn Gly Thr Pro Asn Asp Ala Gly Ala lie Tyr Ala Glu 
125 130 135 

Thr Cys Asn lie Val Gly Asn Gin Gly Ala Leu Leu Leu Asp Ser Asn 
140 145 150 155 

Thr Ala Ala Arg Asn Gly Gly Ala lie Cys Ala Lys Val Leu Asn lie 
160 165 170 

6 0 Gin Gly Arg Gly Pro lie Glu Phe Ser Arg Asn Arg Ala Glu Lys Gly 
175 180 185 
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Gly Ala lie Phe lie Gly Pro Ser Val Gly Asp Pro Ala Lys Gin Thr 
190 195 200 

Ser Thr Leu Thr lie Leu Ala Ser Glu Gly Asp lie Ala Phe Gin Gly 
205 210 215 

Asn Met Leu Asn Thr Lys Pro Gly lie Arg Asn Ala lie Thr Val Glu 
10 220 225 230 235 

Ala Gly Gly Glu lie Val Ser Leu Ser Ala Gin Gly Gly Ser Arg Leu 
240 245 250 

Val Phe Tyr Asp Pro lie Thr His Ser Leu Pro Thr Thr Ser Pro Ser 
255 260 265 

Asn Lys Asp lie Thr lie Asn Ala Asn Gly Ala Ser Gly Ser Val Val 
270 275 280 

20 

Phe Thr Ser Lys Gly Leu Ser Ser Thr Glu Leu Leu Leu Pro Ala Asn 
285 290 295 

Thr Thr Thr He Leu Leu Gly Thr Val Lys He Ala Ser Gly Glu Leu 
300 305 310 315 

Lys lie Thr Asp Asn Ala Val Val Asn Val Ala Gly Phe Ala Thr Gin 
320 325 330 

3 0 Gly Ser Gly Gin Leu Thr Leu Gly Ser Gly Gly Thr Leu Gly Leu Ala 
335 340 345 

Thr Pro Thr Gly Ala Pro Ala Ala Val Asp Phe Thr He Gly Lys Leu 
350 355 360 

Ala Phe Asp Pro Phe Ser Phe Leu Lys Arg Asp Phe Val Ser Ala Ser 
365 370 375 

Val Asn Ala Gly Thr Lys Asn Val Thr Leu Thr Gly Ala Leu Val Leu 
40 380 385 390 395 

Asp Glu His Asp Val Thr Asp Leu Tyr Asp Met Val Ser Leu Gin Ser 
400 405 410 

Pro Val Ala lie Pro He Ala Val Phe Lys Gly Ala Thr Val Thr Lys 
445 420 425 

Thr Gly Phe Pro Asp Gly Glu He Ala Thr Pro Ser His Tyr Gly Tyr 
430 435 440 

50 

Gin Gly Lys Trp Ser Tyr Thr Trp Ser Arg Pro Leu Leu He Pro Ala 
445 450 455 

Pro Asp Gly Gly Phe Pro Gly Gly Pro Ser Pro Ser Ala Asn Thr Leu 
460 465 470 475 

Tyr Ala Val Trp Asn Ser Asp Thr Leu Val Arg Ser Thr Tyr He Leu 
480 485 490 



6 0 Asp Pro Glu Arg Tyr Gly Glu He Val Ser Asn Ser Leu Trp He Ser 
495 500 505 
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Phe Leu Gly Asn Gin Ala Phe Ser Asp lie Leu Gin Asp Val Leu Leu 
510 515 520 

lie Asp His Pro Gly Leu Ser lie Thr Ala Lys Ala Leu Gly Ala Tyr 
525 530 535 

Val Glu His Thr Pro Arg Gin Gly His Glu Gly Phe Ser Gly Arg Tyr 
10 540 545 550 555 

Gly Gly Tyr Gin Ala Ala Leu Ser Met Asn Tyr Thr Asp His Thr Thr 
560 565 570 

Leu Gly Leu Ser Phe Gly Gin Leu Tyr Gly Lys Thr Asn Ala Asn Pro 
575 580 585 

Tyr Asp Ser Arg Cys Ser Glu Gin Met Tyr Leu Leu Ser Phe Phe Gly 
590 595 600 

20 

Gin Phe Pro He Val Thr Gin Lys Ser Glu Ala Leu He Ser Trp Lys 
605 610 615 

Ala Ala Tyr Gly Tyr Ser Lys Asn His Leu Asn Thr Thr Tyr Leu Arg 
620 625 630 635 

Pro Asp Lys Ala Pro Lys Ser Gin Gly Gin Trp His Asn Asn Ser Tyr 
640 645 650 

3 0 Tyr Val Leu He Ser Ala Glu His Pro Phe Leu Asn Trp Cys Leu Leu 
655 660 665 

Thr Arg Pro Leu Ala Gin Ala Trp Asp Leu Ser Gly Phe He Ser Ala 
670 675 680 

Glu Phe Leu Gly Gly Trp Gin Ser Lys Phe Thr Glu Thr Gly Asp Leu 
685 690 695 

Gin Arg Ser Phe Ser Arg Gly Lys Gly Tyr Asn Val Ser Leu Pro He 
40 700 705 710 715 

Gly Cys Ser Ser Gin Trp Phe Thr Pro Phe Lys Lys Ala Pro Ser Thr 
720 725 730 

Leu Thr He Lys Leu Ala Tyr Lys Pro Asp lie Tyr Arg Val Asn Pro 
735 740 745 

His Asn He Val Thr Val Val Ser Asn Gin Glu Ser Thr Ser He Ser 
750 755 760 

50 

Gly Ala Asn Leu Arg Arg His Gly Leu Phe Val Gin He His Asp Val 
765 770 775 

Val Asp Leu Thr Glu Asp Thr Gin Ala Phe Leu Asn Tyr Thr Phe Asp 
780 785 790 795 

Gly Lys Asn Gly Phe Thr Asn His Arg Val Ser Thr Gly Leu Lys Ser 
800 805 810 



60 Thr Phe 



